FF7/Battle/Battle Animation (PC)

From Final Fantasy Inside
< FF7‎ | Battle
Revision as of 22:03, 7 December 2012 by my_wiki>Borde
Jump to navigation Jump to search

Battle Animation File Format

File format discovered/decoded by me, L. Spiro. This file was written by me, L. Spiro, as can be seen by the proper grammar and perfect spelling.

(Wiki-fied by Halkun)

Part I: Structures

There are 4 basic structures we will use in decoding the file format. The header of animation data has been considered to be composed of 3 DWORD’s, 3 WORD’s, and one BYTE, however this is not how the header is really intended to be, despite being aligned correctly.

Battle animation files start with a DWORD which tells us how many animations are in the file. This number includes the special animations which are not actually animations at all. Although I have not yet decoded them, I suspect these are sets of keys for actual animation sets; keys that call scripted actions or tell the engine to print the damage numbers.

FF7FrameHeader

Cloud’s battle animation file (rtda) has 94 (0x 5E) animations in it. After this number begins each animation. Each animation begins with a 12-byte header (3 DWORD’s) we will call "FF7FrameHeader". To get from one animation to the next, start at offset 0x04 in the animation file and begin reading these headers. For each header, skip "FF7FrameHeader.dwChunkSize" bytes until you get to the index of the file you want to load. When skipping, remember to skip starting at the end of the "FF7FrameHeader" header. I mentioned a type of special animation data set that is in the header file. These data sets, when filled with the "FF7FrameHeader" header, will have a "dwChunkSize" less than eleven, we skip them by jumping over the next 8 bytes that follow.


First, the two main headers in the animation file.

1.

typedef struct FF7FrameHeader {
	DWORD		dwBones;		// Bones in the model +	1 (unless we're dealing with a weapon animation, in which case it's value is always 1).	0x00
                                                // For some single bone models dwBones = 1... but for others it's 2. So I guess this field should be taken with a grain of salt.
	DWORD		dwFrames;		// Frames in the animation.	0x04
	DWORD		dwChunkSize;	// Size of the animation set.	0x08
} * PFF7FrameHeader;			// Size = 12 bytes.

2.

typedef struct FF7FrameMiniHeader {
	//SHORT		sBones;	// Bones in the animation.
	SHORT		sFrames;// Apparently, frames in the animation (but sometimes sFrames > dwFrames)
	SHORT		sSize;	// Size of the animation data.	0x02
	BYTE		bKey;		// A key flag used for decoding.	0x04
} * PFF7FrameMiniHeader;	// Size = 5 bytes.

NOTE: sBones should provably be called sFrames since it seems to hold a secondary frames counter. Thus, it should be equal to dwFrames. Unfortunately, it usually isn't. In fact, It's hard to say which one should actually be trusted. Apparently dwFrames is a more conservative value, meaning there will always be at least that many frames in the animation. But there can be more of them. I can't help but wonder if this means the rest of the frames are dummied out information or they serve some sort of purpose. On the other hand, sFrames is sometimes higher than the actual number of frames on the animation chunck.

Anyway, the actual number of frames can be computed by parsing the whole animation chunck.

It's also worth mentioning that there is at least one animation (15th from RSAA, the playable frog) which physically lacks the sFrames field. Instead, sSize is at 0x00 and bKey at 0x02. This animation is more than likely damaged, because FF7 doesn't seems to be able to handle it.

FF7FrameMiniHeader

Most of these members are straight-forward, however there is a very special and VERY important member in the "FF7FrameMiniHeader" structure called "bKey". This is used for every rotation-decoding scheme (but one). It determines, essentially, the precision of the rotations and the deltas that follow in successive frames. The value of “bKey” can only be 0, 2, or 4; the equation "(12 - bKey)" is used to determine the length of each raw (uncompressed) rotation. After decompression, every rotation must be 12 bits, giving it a range from 0 to 4095. But if "bKey" is 4, for example, then that means uncompressed rotations are stored as 8 bits, which gives them a range from 0 to 255. How is this fixed? After the 8 bits are read, they are then shifted left (up) by "bKey". This will place them at 12 bits, but with decreased accuracy. This loss in accuracy is acceptable since rotations work as deltas and usually only change by a small amount. Most large rotation deltas are things that are spinning, such as the blades on Aero Combatant. These cases are always a nice round number that can be handled with lower precision (in the case of Aero Combatant, it is 90 degrees even).


FF7ShortVec

Now the code to skip to any animation, by index, where "iTarget" is the index. This code assumes you have already opened the animation file (hFile) and you have skipped pasted the first 4 bytes.

FF7FrameHeader	fhHeader;
DWORD			dwBytesRead;
for ( int I = 0; I < iTarget; I++ ) {
	if ( !ReadFile( hFile,
		&fhHeader, sizeof( fhHeader ),
		&dwBytesRead, NULL ) ) {

	CloseHandle( hFile );
	return false;
	}
	if ( fhHeader.dwChunkSize < 11 ) {	// If this is a special
						//	chunk, skip it (it
						//	is counted as part
						//	of the total in the
						//	file).
		if ( SetFilePointer( hFile, 8, NULL, FILE_CURRENT ) ==
		INVALID_SET_FILE_POINTER ) {
			CloseHandle( hFile );
			return false;
		}
		continue;			// Go on to the next
						//	animation.
	}
	// Skip this animation set, whose size is determined by
	//	fhHeader.dwChunkSize.
	if ( SetFilePointer( hFile, fhHeader.dwChunkSize, NULL,
	FILE_CURRENT ) == INVALID_SET_FILE_POINTER ) {
		CloseHandle( hFile );
		return false;
	}
}
// Once we come to this point, we are at the very first byte of the
//	animation we want to load.  Let’s store it into a BYTE array.
if ( !ReadFile( hFile,
	&fhHeader, sizeof( fhHeader ),
	&dwBytesRead, NULL ) ) {

	CloseHandle( hFile );
	return false;
}
BYTE * pbBuffer = new BYTE[fhHeader.dwChunkSize];
// Now pbBuffer holds the actual animation data, including the 5-byte
//	“FF7FrameMiniHeader” header.

We now have the animation we want loaded into a BYTE array (remember to delete it later).

FF7FrameBuffer

Now let’s look at the other structures we will use.

3.

typedef struct FF7ShortVec {
	SHORT	sX, sY, sZ;		// Signed short versions.	0x00
	INT	iX, iY, iZ;		// Integer representation.	0x06
	FLOAT	fX, fY, fZ;		// Float version after math.	0x12
} * PFF7ShortRot;			// Size = 30 bytes.

Each rotation goes through 3 forms. Firstly, everything is stored as 2-byte SHORT’s. These SHORT’s are stored from 0 to 4096, where 0 = 0 degrees and 4096 = 360 degrees. This is the equation to convert one of these SHORT’s into degrees: (SHORT / 4096 * 360). Each frame is based off the previous frame, using the SHORT value as its basis. Each SHORT is converted to an INT, which is the exact same as the SHORT version, except always positive. Finally, the FLOAT gets filled with the final value, using the INT version as its base. So, the sequence is:

First frame…

  • Read X bits and store as a signed SHORT.
  • Convert the SHORT to the INT field, adding 0x1000 if negative.
  • Convert to FLOAT using (INT / 4096 * 360). Apply this FLOAT to your model.

Next frame…

  • Read X bits, and add them to the SHORT value from last frame.
  • Convert the SHORT to the INT field, adding 0x1000 if negative.
  • Convert to FLOAT using (INT / 4096 * 360). Apply this FLOAT to your model.

Repeat…


This structure is for one bone rotation. To load an entire frame’s work of bones, we need this structure: 4.

typedef struct FF7FrameBuffer {
	DWORD			dwBones;
	FF7ShortVec		svPosOffset;
	FF7ShortVec		*psvRots;

	FF7FrameBuffer() {
		dwBones = 0;
		psvRots = NULL;
	}
	~FF7FrameBuffer() {
		dwBones = 0;
		delete [] psvRots;
		psvRots = NULL;
	}

	VOID	SetBones( DWORD dwTotal ) {
		// Delete the old.
		dwBones = 0;
		delete [] psvRots;
		
		// Create the new.
		psvRots = new FF7ShortVec[dwTotal];
		if ( psvRots != NULL ) { dwBones = dwTotal; }
	}
} * PFF7FrameBuffer;

This structure will allocate enough memory for one frame of rotations. Simply call “FF7FrameBuffer.SetBones” with the number of bones in your animation.



Part II: Functions and Format

First, we need a way to read bits from the BYTE array we have stored. This is a basic bit-reading function. It reads “dwTotalBits” from “pbBuffer” starting at the “dwStartBit”’th bit.

GetBitsFixed

1.

INT GetBitsFixed( BYTE * pbBuffer, DWORD &dwStartBit,
DWORD dwTotalBits ) {
	INT iReturn = 0;

	for ( DWORD I = 0; I < dwTotalBits; I++ ) {
		iReturn <<= 1;

		__asm mov eax, dwStartBit
		__asm mov eax, [eax]
		__asm cdq
		__asm and edx, 7
		__asm add eax, edx
		__asm sar eax, 3
		__asm mov ecx, pbBuffer
		__asm xor edx, edx
		__asm mov dl, byte ptr ds:[ecx+eax]
		__asm mov eax, dwStartBit
		__asm mov ecx, [eax]
		__asm and ecx, 7
		__asm mov eax, 7
		__asm sub eax, ecx
		__asm mov esi, 1
		__asm mov ecx, eax
		__asm shl esi, cl
		__asm and edx, esi
		__asm test edx, edx
		__asm je INCBIT
		iReturn++;
INCBIT:	dwStartBit++;
	}

	// Force the sign bit to extend across the 32-bit boundary.
	iReturn <<= (0x20 - dwTotalBits);
	iReturn >>= (0x20 - dwTotalBits);
	return iReturn;
}

Now that we can read the bits in the buffer we have made, it’s time to know what we’re doing!

A.

The animation data begins with one full frame that is uncompressed, but stored in one of 3 ways. Every frame after that is compressed, but compressed in one of three ways; one way can be decoded using the same method as on the first frame, which is why sometimes the second, third, and even fourth frames can be decoded using the same method as was used on the first frame. First Frame: Remember that we stored our animation buffer with a 5-byte “FF7FrameMiniHeader” at the beginning of it? We need this header now!

PFF7FrameMiniHeader pfmhMiniHeader = (PFF7FrameMiniHeader)pbBuffer;

After this cast, “pfmhMiniHeader->bKey” will contain a number, either 0, 2, or 4. Each rotation is stored in (12 - “pfmhMiniHeader->bKey”) bits. This mean either 12, 10, or 8, respectively. This is important to know. But first, there is offset data. Each offset is 16 bits (a signed SHORT). In the first frame of Cloud’s first animation (rtda), these bytes are 00 00 FE 2E 00 00. 16 bits × 3 = 48 bits, or 6 bytes. To get these bits, we first need to make a pointer point to the correct location. “pbBuffer” points 5 bytes before this data, so let’s make a pointer that points to this data directly.

BYTE * pbAnimBuffer = &pbBuffer[5];

When we use “GetBitsFixed()” to get the bits.

DWORD dwBitStart = 0;	// The bits at which to begin reading in the
//	stream.
SHORT sX = GetBitsFixed( pbAnimBuffer, dwBitStart, 16 );
SHORT sY = GetBitsFixed( pbAnimBuffer, dwBitStart, 16 );
SHORT sZ = GetBitsFixed( pbAnimBuffer, dwBitStart, 16 );

After doing this, we have each of the three offsets, 0, -466, and 0. The Y (-466) is always stored as its inverse, but for now we don’t worry about that.

The first frame is uncompressed, but it could be 12, 10, or 8 bits per rotation. How do we know? “pfmhMiniHeader->bKey”!

For each bone, there are 3 rotations. So, for each bone, we do this:

SHORT sRotX = GetBitsFixed( pbAnimBuffer, dwBitStart,
12 - pfmhMiniHeader->bKey );
SHORT sRotY = GetBitsFixed( pbAnimBuffer, dwBitStart,
12 - pfmhMiniHeader->bKey );
SHORT sRotZ = GetBitsFixed( pbAnimBuffer, dwBitStart,
12 - pfmhMiniHeader->bKey );
// We have each rotation, but for the equation to work, the range
//	must always be from 0 to 4095.  If we got 8 bytes, for example,
//	the range would only be from 0 to 255, so here we need to fix
//	this.
sRotX <<= pfmhMiniHeader->bKey;
sRotY <<= pfmhMiniHeader->bKey;
sRotZ <<= pfmhMiniHeader->bKey;

The first rotation is always 0, 0, 0. This is the root rotation and is not actually counted as part of the bone network of the character.


GetDynamicFrameOffsetBits

The first frame is easy. Remember that all frames after are stored as relative offsets from the frame before it. The offsets are relative to the SHORT values of the previous frame rather than the FLOAT or INT values.

Each frame begins with the three position offset values, but in the second frame and after, they can be either 7 or 16 bits. To determine if which they are, we first get one bit. If that bit is 0, then the following 7 bits are the actual value of the offset (signed). If it is 1, then the next 16 bits are the value of the offset. In total, the offsets will be either 8 or 17 bits.

Now the code to perform this operation.

2.

SHORT GetDynamicFrameOffsetBits( BYTE * pBuffer, DWORD &dwBitStart ) {
DWORD dwFirstByte, dwConsumedBits, dwBitsRemainingToNextByte, dwTemp;
	SHORT sReturn;
	__asm {
		mov eax, dwBitStart
		mov eax, [eax]
		cdq
		and edx, 7
		add eax, edx
		sar eax, 3
		mov dwFirstByte, eax
		mov ecx, dwBitStart
		mov edx, [ecx]
		and edx, 7
		mov dwConsumedBits, edx
		mov eax, 7
		sub eax, dwConsumedBits
		mov dwBitsRemainingToNextByte, eax
		mov ecx, pBuffer
		add ecx, dwFirstByte		// Go to the first byte that
						// has the bit where we
						// want to begin.
		xor edx, edx
		mov dl, byte ptr ds:[ecx]
		shl edx, 8
		mov eax, pBuffer
		add eax, dwFirstByte
		xor ecx, ecx
		mov cl, byte ptr ds:[eax+1]
		or edx, ecx
		mov dwTemp, edx
		mov ecx, dwBitsRemainingToNextByte
		add ecx, 8
		mov edx, 1
		shl edx, cl
		mov eax, dwTemp
		and eax, edx
		test eax, eax
		jnz SeventeenBits

EightBits :
		mov ecx, dwConsumedBits
		add ecx, 1
		mov edx, dwTemp
		shl edx, cl
		movsx eax, dx
		sar eax, 9
		mov sReturn, ax
		mov ecx, dwBitStart	//
		mov edx, [ecx]		//
		add edx, 8		//
		mov eax, dwBitStart	//
		mov [eax], edx		// Increase dwBitStart by 0x8 (8).
		jmp End

SeventeenBits :
		mov ecx, dwTemp
		shl ecx, 8
		mov edx, pBuffer
		add edx, dwFirstByte
		xor eax, eax
		mov al, byte ptr ds:[edx+2]
		or ecx, eax
		mov dwTemp, ecx
		mov ecx, dwConsumedBits
		add ecx, 1
		mov edx, dwTemp
		shl edx, cl
		shr edx, 8
		mov sReturn, dx
		mov eax, dwBitStart	//
		mov ecx, [eax]		//
		add ecx, 0x11		//
		mov edx, dwBitStart	//
		mov [edx], ecx		// Increase dwBitStart by 0x11
					//	(17).

End :
	}
	return sReturn;
}

After the first frame, we know that the positional offsets immediately follow. So to get the positional deltas for the next frame, we would do this:

SHORT sDeltaX = GetDynamicFrameOffsetBits( pbAnimBuffer, dwBitStart );
SHORT sDeltaY = GetDynamicFrameOffsetBits( pbAnimBuffer, dwBitStart );
SHORT sDeltaZ = GetDynamicFrameOffsetBits( pbAnimBuffer, dwBitStart );

Now we have the change from the previous frame. In our “FF7ShortVec” structure, these are the SHORT values. To get the position of this frame, we add these offsets to the last frame’s position. If “I” is this frame and “I-1” is the last frame, we could do something like this:

FF7FrameBuffer[I].svPosOffset.sX = FF7FrameBuffer[I-1].svPosOffset.sX + sX;
FF7FrameBuffer[I].svPosOffset.sY = FF7FrameBuffer[I-1].svPosOffset.sY + sY;
FF7FrameBuffer[I].svPosOffset.sZ = FF7FrameBuffer[I-1].svPosOffset.sZ + sZ;


GetEncryptedRotationBits

Now all that is left is to decode the rotations. Rotations change size in multiple ways. There is no single simple way to express them.

They are, however, always at least one bit long. The first bit is a flag. If 0, the rotational change is 0, and that is the end of that rotation. If it is not 0, then we must get the next 3 bits. The next 3 bits can tell us to do one of three things. If the resulting 3-bit signed value is 0, then the rotation delta is (-1 << pfmhMiniHeader->bKey). This is the smallest possible decrement for the given precision (remember that precision is based off “pfmhMiniHeader->bKey”. If the 3-bit value is 7, then we treat the rotation the same way as we do in the first frame, where we read (12-pfmhMiniHeader->bKey) bits, then shift left by “pfmhMiniHeader->bKey”.

The complicated cases are 1 through 6. If the 3-bit value is from 1 to 6, then this indicates the number of bits in the rotation delta. For our example, let’s assume the 3-bit value was 4. This means we need to read the next 4 bits from the stream. These 4 bits will be the animation delta, but we actually have to handle them before we can call it final. The first bit of this new data is a sign bit which determines if the value is below 0. If it is below zero, we must subtract from that number (1 << ([Number of Bits] – 1)). So, if the 3-bit value was 4, and we read 4 bits from the stream, and the resulting value was negative, we would subtract from that value (1 << 3), or 8. If the 4-bit value is positive, we add (1 << ([Number of Bits] – 1)) to it. After we handle the positive and negative cases, we have to adjust for our precision again. So, we shift left the resulting value by “pfmhMiniHeader->bKey”. This is all shown in the code below.

3.

SHORT GetEncryptedRotationBits( BYTE * pBuffer, DWORD &dwBitStart,
INT iKeyBits ) {
	DWORD dwNumBits, dwType;
	INT iTemp;
	SHORT sReturn;
	// Check the first bit.
	INT iBits = GetBitsFixed( pBuffer, dwBitStart, 1 );
	__asm mov eax, iBits	// If the first bit is 0, return 0
				// and continue. It is not necessary
				// to mov iBits into EAX, but I do it
				// anyway.
	__asm test eax, eax
	__asm jnz SecondTest
	__asm jmp ReturnZero	// Return 0

SecondTest :
	// Otherwise continue by getting the next 3 bits.
	iBits = GetBitsFixed( pBuffer, dwBitStart, 3 );
	__asm mov eax, iBits
	__asm and eax, 7
	__asm mov dwNumBits, eax
	__asm mov ecx, dwNumBits
	__asm mov dwType, ecx	// dwType = ecx = dwNumBits = eax =
				//	(iBits & 7).
				//	When we get to the case, all of
				//	these values are the same.
	__asm cmp dwType, 7
	__asm ja ReturnZero	// Is dwType above 7?  If so, return 0.
				//	This can never actually happen.

	// Otherwise, use it in a switch case.
	switch ( dwType ) {
		case 0 : {
			__asm or eax, 0xFFFFFFFF	// After this, EAX will
							// always be -1.
			__asm mov ecx, iKeyBits
			__asm shl eax, cl		// Shift left by
							// precision.
							// (-1 << iKeyBits)
			__asm mov sReturn, ax		// Return that number.
			__asm jmp End
		}
		case 1 : {}
		case 2 : {}
		case 3 : {}
		case 4 : {}
		case 5 : {}
		case 6 : {
			// Get a number of bits equal to the case switch (1,
			// 2, 3, 4, 5, or 6).
			iTemp = GetBitsFixed( pBuffer, dwBitStart,
				dwNumBits );
			__asm mov eax, iTemp
			__asm cmp iTemp, 0
			__asm jl IfLessThanZero
			// If greater than or equal to 0…
			__asm mov ecx, dwNumBits	// dwNumBits = (iBits &
							// 7) from before.
			__asm sub ecx, 1		// dwNumBits - 1.
			__asm mov eax, 1
			__asm shl eax, cl		// (1 << (dwNumBits –
							// 1)).
			__asm mov ecx, iTemp
			__asm add ecx, eax		// iTemp += (1 <<
							// (dwNumBits - 1)).
			__asm mov iTemp, ecx
			__asm jmp AfterTests
			// If less than 0…
IfLessThanZero :
			__asm mov ecx, dwNumBits	// dwNumBits = (iBits &
							// 7) from before.
			__asm sub ecx, 1		// Decrease it by 1.
			__asm mov edx, 1
			__asm shl edx, cl		// Shift “1” left by
							// (dwNumBits - 1).
			__asm mov eax, iTemp		// iTemp still has the
							// bits we read
							// from before.
			__asm sub eax, edx		// iTemp - (1 <<
							// (dwNumBits - 1))
			__asm mov iTemp, eax

			// Now, whatever we set on iTemp, we need to shift it
			// up by the precision value.
AfterTests :
			__asm mov eax, iTemp
			__asm mov ecx, iKeyBits
			__asm shl eax, cl			// iTemp <<= iKeyBits
			__asm mov sReturn, ax
			__asm jmp End
		}

		case 7 : {
			// Uncompressed bits.  Use standard decoding.
			iTemp = GetBitsFixed( pBuffer, dwBitStart,
				12 - iKeyBits );
			__asm mov ecx, iKeyBits
			__asm shl eax, cl			// iTemp <<= iKeyBits.
			__asm mov sReturn, ax
			__asm jmp End
		}

	}


ReturnZero :
	__asm xor ax, ax
	__asm mov sReturn, ax

End :

	return sReturn;
}


Part III: Putting it All Together

To make life easy, let’s use one function to load an entire frame at a time. This function will load an entire frame into a “FF7FrameBuffer” structure. The function will return the bit position where the next frame will begin. After the function returns, we must translate the rotational INT values to their FLOAT forms (although the function can be modified to do this part itself). This function will be called in a loop for every frame in the rotation.

LoadFrames

1.

DWORD LoadFrames( PFF7FrameBuffer pfbFrameBuffer,
INT iBones,
INT iBitStart,
BYTE * pbAnimBuffer ) {
	// Get backups of the information we need.
DWORD		dwThisBitStart	= iBitStart;
	INT		iThisBones		= iBones;
	BYTE *	pbThisBuffer	= pbAnimBuffer;

	PFF7FrameMiniHeader pfmhMiniHeader =
		(PFF7FrameMiniHeader)pbAnimBuffer;
	SHORT	sSize = pfmhMiniHeader->sSize;						
	BYTE	bKeyBits = pfmhMiniHeader->bKey;

	// Skip the first 5 bytes because they are part of the frame header.
	pbThisBuffer += sizeof( FF7FrameMiniHeader );

	if ( iBitStart == 0 ) {	// First frame?
		// The first frame is uncompressed and each value is the
		// actual rotation.
		pfbFrameBuffer->svPosOffset.sX = GetBitsFixed( pbThisBuffer, dwThisBitStart, 16 ); //Always 16 bits
		pfbFrameBuffer->svPosOffset.sY = GetBitsFixed( pbThisBuffer, dwThisBitStart, 16 );
		pfbFrameBuffer->svPosOffset.sZ = GetBitsFixed( pbThisBuffer, dwThisBitStart, 16 );

		// This function will set the FLOAT values for the
		//	positions.
		// Any scaling that needs to be done would be done here.
		pfbFrameBuffer->svPosOffset.fX = (FLOAT)pfbFrameBuffer->svPosOffset.sX;
		pfbFrameBuffer->svPosOffset.fY = (FLOAT)pfbFrameBuffer->svPosOffset.sY;
		pfbFrameBuffer->svPosOffset.fZ = (FLOAT)pfbFrameBuffer->svPosOffset.sZ;

		for ( int I = 0; I < iThisBones; I++ ) {
			// Now get each bone rotation (the first bone is
			//	actually the root, not part of the skeleton).
			// During the first frame, the rotations are always
			//	(12 – bKeyBits).
			// We shift by bKeyBits to align it to 12 bits.
			pfbFrameBuffer->psvRots[I].sX = (GetBitsFixed( pbThisBuffer, dwThisBitStart, 12 - bKeyBits ) << bKeyBits);
			pfbFrameBuffer->psvRots[I].sY = (GetBitsFixed( pbThisBuffer, dwThisBitStart, 12 - bKeyBits ) << bKeyBits);
			pfbFrameBuffer->psvRots[I].sZ = (GetBitsFixed( pbThisBuffer, dwThisBitStart, 12 - bKeyBits ) << bKeyBits);

			// Store the INT version as the absolute value
			//	of the SHORT version.
			pfbFrameBuffer->psvRots[I].iX = (pfbFrameBuffer->psvRots[I].sX < 0) ? pfbFrameBuffer->psvRots[I].sX + 0x1000 : pfbFrameBuffer->psvRots[I].sX;
			pfbFrameBuffer->psvRots[I].iY = (pfbFrameBuffer->psvRots[I].sY < 0) ? pfbFrameBuffer->psvRots[I].sY + 0x1000 : pfbFrameBuffer->psvRots[I].sY;
			pfbFrameBuffer->psvRots[I].iZ = (pfbFrameBuffer->psvRots[I].sZ < 0) ? pfbFrameBuffer->psvRots[I].sZ + 0x1000 : pfbFrameBuffer->psvRots[I].sZ;
		}
	}
	else {					// All other frames.
		SHORT sX, sY, sZ;			// Get the positional
//	offsets.
		sX = GetDynamicFrameOffsetBits( pbThisBuffer, dwThisBitStart );
		sY = GetDynamicFrameOffsetBits( pbThisBuffer, dwThisBitStart );
		sZ = GetDynamicFrameOffsetBits( pbThisBuffer, dwThisBitStart );

		// When we come to this area of the function,
//	pfbFrameBuffer will have the previous frame
//	still stored in it.  Just add the offsets.
		pfbFrameBuffer->svPosOffset.sX += sX;
		pfbFrameBuffer->svPosOffset.sY += sY;
		pfbFrameBuffer->svPosOffset.sZ += sZ;

		pfbFrameBuffer->svPosOffset.fX = (FLOAT)pfbFrameBuffer->svPosOffset.sX;
		pfbFrameBuffer->svPosOffset.fY = (FLOAT)pfbFrameBuffer->svPosOffset.sY;
		pfbFrameBuffer->svPosOffset.fZ = (FLOAT)pfbFrameBuffer->svPosOffset.sZ;
		for ( int I = 0; I < iThisBones; I++ ) {
			// The same applies here.  Add the offsets
			//	and convert to INT form, adding 0x1000
			//	if it is less than 0.
			// When Final Fantasy® VII loads these animations,
			//	it is possible for the value to sneak up above
			//	the 4095 boundary through a series of positive
			//	offsets. 
			sX = GetEncryptedRotationBits( pbThisBuffer, dwThisBitStart, bKeyBits );
			sY = GetEncryptedRotationBits( pbThisBuffer, dwThisBitStart, bKeyBits );
			sZ = GetEncryptedRotationBits( pbThisBuffer, dwThisBitStart, bKeyBits );

			pfbFrameBuffer->psvRots[I].sX += sX;
			pfbFrameBuffer->psvRots[I].sY += sY;
			pfbFrameBuffer->psvRots[I].sZ += sZ;

			pfbFrameBuffer->psvRots[I].iX = (pfbFrameBuffer->psvRots[I].sX < 0) ? pfbFrameBuffer->psvRots[I].sX + 0x1000 : pfbFrameBuffer->psvRots[I].sX;
			pfbFrameBuffer->psvRots[I].iY = (pfbFrameBuffer->psvRots[I].sY < 0) ? pfbFrameBuffer->psvRots[I].sY + 0x1000 : pfbFrameBuffer->psvRots[I].sY;
			pfbFrameBuffer->psvRots[I].iZ = (pfbFrameBuffer->psvRots[I].sZ < 0) ? pfbFrameBuffer->psvRots[I].sZ + 0x1000 : pfbFrameBuffer->psvRots[I].sZ;
		}
	}

	// If we did not read as many bits as there are in the frame,
	//	return the location where the bits should start for the
	//	next frame.
	if ( (SHORT)(dwThisBitStart / 8) < sSize ) {
return dwThisBitStart;
}
	// Otherwise, return 0.
	return 0;
}

2.

A Sample Loop

This is an example loop that could be used to load a full animation.

FF7FrameHeader fhHeader;
ReadFile( hFile, &fhHeader, sizeof( fhHeader ), &ulBytesRead,
NULL );
BYTE * baData = new BYTE[fhHeader.dwChunkSize];
ReadFile( hFile, &baData, fhHeader.dwChunkSize, &ulBytesRead,
NULL );

	// This will be our buffer to hold one frame.
	//	We will only buffer one frame at a time, so to
	//	to fully load the animations, you would need to
	//	write your own routine to store the data in
	//	fbFrameBuffer after each loaded frame.
FF7FrameBuffer fbFrameBuffer;
fbFrameBuffer.SetBones( fhHeader.dwBones );
INT iBits = 0;
for ( DWORD J = 0; J < fhHeader.dwFrames; J++ ) {
	// We pass a pointer to fbFrameBuffer.  The first frame
	//	will load diretly into it.
	// Every frame after that will actually use it with the
	//	offsets loaded to determine the final result of that
	//	frame. 
		iBits = LoadFrames( &fbFrameBuffer, fhHeader.dwBones, iBits, baData );
		// Reverse the Y offset (required).
		fbFrameBuffer.svPosOffset.fY = 0.0f – fbFrameBuffer.svPosOffset.fY;

// The first rotation set is skipped.  It is not part of
//	the skeleton.  Skipping is optional, but
//	Final Fantasy® VII skips it; it is always 0, 0, 0.
// I believe the actual use for the “root” rotation is
//	to dynamically make the model point at its target
//	or face different directions during battle.
// UPDATE: Although the value of this field is 0,0,0 for most animations, some actually store a base rotation here
// so it shouldn't be ignored.
		for ( DWORD I = 0; I < fhHeader.dwBones - 1; I++ ) {
			fbFrameBuffer.psvRots[I+1].fX = (FLOAT)fbFrameBuffer.psvRots[I+1].iX / 4096.0f * 360.0f;
			fbFrameBuffer.psvRots[I+1].fY = (FLOAT)fbFrameBuffer.psvRots[I+1].iY / 4096.0f * 360.0f;
			fbFrameBuffer.psvRots[I+1].fZ = (FLOAT)fbFrameBuffer.psvRots[I+1].iZ / 4096.0f * 360.0f;
		}
// Store the data for this frame here (in your own
//	routine).

	}

	delete [] baData;

Part IV: Qhimm’s Input

Qhimm has taken the time to rewrite two of these functions used in decoding, so it is easier to understand for people who know C++ better than they know assembly (despite my comments being in the assembly code). He has also written a more in-depth look at the logistics behind the rotation compression format and explains its limitations

“GetValueFromStream” is the C/C++ version of my “GetDynamicFrameOffsetBits” and his “GetCompressedDeltaFromStream” is the C++ version of my “GetEncryptedRotationBits”.

short GetValueFromStream( BYTE *pStreamBytes,
DWORD *pdwStreamBitOffset )
{
	// The return value;
    short sValue;
	// The number of whole bytes already consumed in the stream.
	DWORD dwStreamByteOffset = *pdwStreamBitOffset / 8;
	// The number of bits already consumed in the current stream byte.
	DWORD dwCurrentBitsEaten = *pdwStreamBitOffset % 8;
	// The distance from dwNextStreamBytes' LSB to the 'type' bit.
	DWORD dwTypeBitShift = 7 - dwCurrentBitsEaten;
	// A copy of the next two bytes in the stream (from big-endian).
	DWORD dwNextStreamBytes = pStreamBytes[dwStreamByteOffset] << 8 | pStreamBytes[dwStreamByteOffset + 1];

	// Test the first bit (the 'type' bit) to determine the size of the value.
	if (dwNextStreamBytes & (1 << (dwTypeBitShift + 8)))
	{	// Sixteen-bit value:
		// Collect one more byte from the stream.
		dwNextStreamBytes = dwNextStreamBytes << 8 |
			pStreamBytes[dwStreamByteOffset + 2];
		// Shift the delta value into place.
		sValue = (dwNextStreamBytes << (dwCurrentBitsEaten + 1)) >> 8;
		// Update the stream offset.
		*pdwStreamBitOffset += 17;
	}
	else
	{	// Seven-bit value
		// Shift the delta value into place (taking care to preserve the sign).
		sValue = ((short)(dwNextStreamBytes << (dwCurrentBitsEaten + 1))) >> 9;
		// Update the stream offset.
		*pdwStreamBitOffset += 8;
	}

	// Return the value.
	return sValue;
}


short GetCompressedDeltaFromStream( BYTE *pStreamBytes, DWORD *pdwStreamBitOffset,
int nLoweredPrecisionBits )
{
	unsigned int nBits;
	int iFirstBit = GetBitsFromStream( pStreamBytes, pdwStreamBitOffset, 1 );
	if (iFirstBit)
	{
		unsigned int uType = GetBitsFromStream( pStreamBytes,
			pdwStreamBitOffset, 3 ) & 7;
		switch (uType)
		{
		case 0:
			// Return the smallest possible decrement delta (at given
			//	precision).
			return (-1 << nLoweredPrecisionBits);

		case 1: case 2: case 3: case 4: case 5: case 6:
			// Read a corresponding number of bits from the stream.
			int iTemp = GetBitsFromStream( pStreamBytes,
				pdwStreamBitOffset, nBits );
			// Transform the value into the full seven-bit value, using the bit
			//	length
			// as part of the encoding scheme (see notes).
			if (iTemp < 0)	iTemp -= 1 << (nBits - 1);
			else			iTemp += 1 << (nBits - 1);
			// Adapt to the requested precision and return.
			return (iTemp << nLoweredPrecisionBits);
		case 7:
			// Read an uncompressed value from the stream (at requested
			//	precision), and return.
			iTemp = GetBitsFromStream( pStreamBytes,
				pdwStreamBitOffset, 12 - nLoweredPrecisionBits );
			return (iTemp << nLoweredPrecisionBits);
		default:
		}
	}
	// Default/error: return zero.
	return 0;
}


/*
	Notes for rotational delta compression scheme
	=============================================

	The delta values are stored in compressed form in a bit stream. Consecutive
	values share no bit correlation or encoding dependencies, rather they are
	encoded separately using a scheme designed to optimize small-scale rotations.

	Rotations are traditionally given in normalized PSX 4.12 fixed-point compatible
	values, where a full rotation is the integer value 4096. Values above 4095 simply
	map down into the [0,4095] range, as expected from rotational arithmetics. When
	encoded, only the required 12 bits of precision are ever considered.

	In some animations, the author can choose to forcibly lower the precision of
	rotational delta values below 12 bits. Though these animations are naturally not
	as precise, they encode far more efficiently, both because of the smaller size of
	'raw' values, and also because of the increased relative span of the 'close-range'
	encodings available for small deltas. The smallest 128 [-64,63] sizes of deltas
	can be stored in compressed form instead of as raw values. This method is
	efficient
	since a majority of the rotational deltas involved in skeletal character animation
	will be small, and thus doubly effective if used with reduced precision. Precision
	can be reduced by either 2 or 4 bits (down to 10 or 8 bits).

	The encoding scheme is capable of encoding any 12-bit value as follows:

	First, a single bit tells us if the delta is non-zero. If this bit is zero, there
	is no delta value (0) and the decoding is done. Otherwise, a 3 bit integer
	follows, detailing how the delta value is encoded. This has 8 different meanings,
	as follows:

	Type   Meaning
	------ -------------------------------------------------------
	0      The delta is the smallest possible decrement (under the
		current precision)
	1-6    The delta is encoded using this many bytes
	7      The delta is stored in raw form (in the current precision)

	The encoding of small deltas works as follows: The encoded delta can be stored
	using 1-6 bits, giving us a total of 2+4+8+16+32+64 = 126 possible different
	values, which during this explanation will be explained as simple integers (the
	lowest bits of the delta, in current precision). The values 0 (no change) and -1
	(minimal decrement) are already covered, leaving the other 126 values to neatly
	fill out the entire 7 bit range. We do this by encoding each value like follows:

	- The magnitude of the delta is defined as the value of its most significant
	  value bit (in two's complement, so the highest bit not equal to the sign bit).
	  For example, the values '1' and '-2' have magnitude 1, while the value '30' will
	  have a magnitude of 32. For simplicity, we also define the 'signed magnitude' as
	  the magnitude multiplied by the sign of the value (so '-2' has a signed
 	  magnitude of -1).
	- When encoding a value, we subtract its signed magnitude; essentially pushing
	  everything down one notch towards zero, setting the most significant value bit
	  to equal the sign bit and thus ensuring that none of the transformed values
	  require more than six bits to accurately represent in two's complement.
	- The transformed value is then stored starting from its magnitude bit (normally,
	  you would have to start one bit higher to include the sign bit and prevent
 	  signed integer overflow). Small values will be stored using fewer bits, while
 	  larger values use more bits. The two smallest values, 0 and -1, are not
 	  encodeable but are instead handled using the previously mentioned scheme.

	When decoding, you only need to know the number of bits of the encoded value, use
	the value of its most significant bit (not the most significant value bit!) as
	the magnitude, multiply it by the sign of the encoded value to get the signed
	magnitude, and then add that to the encoded value to get the actual delta value.

	Some examples of encodings:

	Delta value *      Encoded            *) in current precision, as integer
	------------------ ------------------
	 0                 0
	-1                 1 000
	-5                 1 011 111
	15                 1 100 0111
	128                1 111 xxxx10000000  (length depends on precision)


	(Note: The reduced precision is treated as rounding towards negative infinity)

*/