Logical vs Memory Structure Arrangement

In a previous post I talked about how avoiding automatic structure padding can be beneficial for performance, because of the importance of cache locality in modern CPU architectures.

Today I’ll talk about why I wish C or C++ would allow us to specify a logical arrangement for struct members in order to increase code readability.

Data structure alignment

If you specify a structure or class in C/C++, the compiler will arrange its members in memory in the same order as you specified them in the struct/class definition (ignoring for the sake of simplicity virtual table pointers added in C++).

So, for example, the following struct will have the member X come before Y in memory.

struct MyData
{
  char X; // 1 byte
  int Y;  // 4 bytes
};

In this case, the compiler will also insert 3 bytes of padding after X in order to keep Y aligned to a word boundary.

When default arrangement sucks

Default structure arrangement can be problematic at times, especially when dealing with large structures.

For example, it is often much more readable to group fields together logically, but this can lead to memory memory wastage caused by padding, like in the following simple example:

struct ReadableButSuboptimal
{
  entity* Entities;
  u16 EntityCount;

  mesh* Meshes;
  u8 MeshCount;

  shader* Shaders;
  u8 ShaderCount;

  u32 Flags;
};

This is a perfectly reasonable arrangement for a human: each pointer is followed by the count for the array.

Unfortunately, it also wastes memory, because of the bytes of padding introduced by the compiler. An optimal, but less readable arrangement would be the following:

struct LessReadableButOptimal
{
  entity* Entities;
  mesh* Meshes;
  shader* Shaders;

  u16 EntityCount;
  u8 MeshCount;
  u8 ShaderCount;

  u32 Flags;
};

This arrangement wastes no bytes in padding, but is, in my opinion, less readable than the previous.

Logical arrangement as an option

A better approach for cases like this where the order of members is not important would be to specify structure members logically, for example by decorating the structure like so:

struct InAnIdealWorld
{
  entity* Entities;
  u16 EntityCount;

  mesh* Meshes;
  u8 MeshCount;

  shader* Shaders;
  u8 ShaderCount;

  u32 Flags;
} __attribute__((rearrange)); // NOTE: Made up attribute, doesn't actually exist

This would tell the compiler: “Hey, I don’t really care about the order in which the members of this struct are laid out, rearrange them at will”.

Obviously this shouldn’t be the default behavior as it would cause all sorts of bugs when dealing with structures that have to cross API boundaries between libraries, but it would be very useful for internal subsystems…

I think ;)

Metric Panda Games

One pixel at a time.

Logical vs Memory Structure Arrangement

Data structure alignment

When default arrangement sucks

Logical arrangement as an option