Posted by

Posted on

January 15, 2026

Posted under

Comments

Fun With Templates

C++ programming language, just like a regular English language, has concept of “genericity”. For instance, the word “bank“, in English, is generic in the sense that the word can mean an institution (noun) or an activity (verb). The meaning is inferred from the context of the word usage.

Genericity introduces certain degree of flexibility in the language. The flexibility covers etymologically related or even, in some cases, unrelated meanings (or links?). For instance, “bank” may refer to financial institution as a noun or, related activity as a verb, or edge of a river as an unrelated (to the institution) noun. Basically, genericity can be viewed as minimizing the vocabulary and maximizing the utility or means to control the complexity of a language.

Similarly, in C++ language, there is a notion of generic programming which is implemented by using templates. Consider the following C++ code

			
template<typename T>
void Exchange(T& a, T& b)
{
    T temp = a;
    a = b;
    b = temp;
}

		

This function, Exchange, can be used for floats or ints alike to exchange the value like so

			
int a = 10;
int b = 20;
float c = 100;
float d = 200;
Exchange(a, b); // a becomes 20 and b becomes 10
Exchange(c, d); // c becomes 200 and d becomes 100

		

Note how the function Exchange is used for both ints and floats type. Based on the context, int and float calls, compiler generates two copies of Exchange function, one each for int and float type respectively, using which the values of variables are exchanged.

Posted by

ravimohan

Posted on

March 6, 2025

Posted under

The holographic view

Comments

Leave a comment

Blood Splat

Recently, whilst making an FPS game, I happen to implement a feature involving blood. The idea is, when you butcher zombies, their post-coagulated(?) blood spills on the floor they are standing or rather creeping on. Clearly we need to use something “other than” UParticleSystem for the purpose because we are not looking those particle effects which look like spray in air

The gif shows two kinds of blood effects. One is the blood-sprayed-in-air effect (particle effect) and other is splat on ground (decal).

For particle system we have the following Unreal C++ declaration

UPROPERTY(BlueprintReadWrite, EditAnywhere, Category = "Effects")
TArray<UParticleSystem*> BloodEffects;

and following code implementation

UParticleSystemComponent* PSC = NewObject<UParticleSystemComponent>(this, UParticleSystemComponent::StaticClass());
...			
PSC->SetWorldLocationAndRotation(LastTakeHitInfo.RelHitLocation + GetActorLocation(), LastTakeHitInfo.RelHitLocation.Rotation());
...

Here first we create new object of type UParticleSystemComponent and set to raycast’s hit location on the pawn. Hence spray effect can be seen.

What about the blot spats on the floor?

For that, we take help of decals.

UPROPERTY(BlueprintReadWrite, EditAnywhere, Category = "Effects")
TArray<FBloodDecalInfo> BloodDecals;

Basically, we send line trace from hit location (on the pawn) towards vertically downward direction and obtain the hit location on the floor

GetWorld()->LineTraceSingleByChannel(Hit, TraceStart, TraceStart + FVector(0.f, 0.f, -1.f) * 200, ECC_Visibility, FCollisionQueryParams(NAME_BloodDecal, false, this)))

and use Engine’s SpawnDecalAtLocation

UDecalComponent* DecalComp = UGameplayStatics::SpawnDecalAtLocation(GetWorld(), DecalInfo.Material, FVector(1.0f, DecalScale.X, DecalScale.Y), Hit.Location, (-Hit.Normal).Rotation(), 0);

to spawn blood decal at Hit.Location.

Posted by

ravimohan

Posted on

June 14, 2023

Posted under

The holographic view

Comments

Leave a comment

C++ Typecasts – An Assembly POV

This post may be considered a continuation of the post Addressing The Addresses or a separate read. We are basically interested in understanding how type-casts are done at assembly level, corresponding to C++ language, which may give a clue about the address arithmetic being performed leading to the offsets discussed in the post I mentioned in the beginning.

First let me demonstrate the typecasting of primitive types (I am taking some stuff from this wiki page). Consider the C++ code

int aVar = 65;
int* intPointer = &aVar;
    
char* charPointer = (char*) intPointer;

In the last line we are doing an explicit type conversion from int to char.

From assembly instruction perspective, there is no difference between int and char as far as the casting is concerned. Information about one pointer (of type int) is shared with the different pointer (of type char). Only the dereferencing bit differs like so

char b = *aP;         mov     rax, QWORD PTR [rbp-8]
                      movzx   eax, BYTE PTR [rax]
                      mov     BYTE PTR [rbp-9], al

while for int

int d = *cP;          mov     rax, QWORD PTR [rbp-24]
                      mov     eax, DWORD PTR [rax]
                      mov     DWORD PTR [rbp-28], eax

for char type of dereferencing, BYTE is read, while, for int type of dereferencing, DWORD is read.

Moving on, we now consider user defined data types, specially classes. Consider the following code

#include <iostream>
class UObjectBase
{
    int a;
    int b;
};

class UObject : public UObjectBase
{
public:
    virtual ~UObject() = default;
};

// Type your code here, or load an example.
int main()
{
    UObjectBase bar;
    UObjectBase* ptr = &bar;

    UObject* fp = (UObject*) ptr;

    std::cout << &bar << '\n';
    std::cout << fp << '\n';
}

The out put of above program may be

0x7ffe4dd2f998
0x7ffe4dd2f990 // 8 bytes offset

however on removing the virtual function (the destructor ~UObject()) may lead to the out put

0x7ffe4dd2f998
0x7ffe4dd2f998 // same address

Here, we are basically down casting from UObjectBase to UObject and that leads to the offset of 8 bytes in presence of virtual function (vtable). The assembly code instructions (generated vis godbolt) look like

    UObject* fp = (UObject*) ptr;    cmp     QWORD PTR [rbp-8], 0
                                     mov     eax, 0
                                     mov     rax, QWORD PTR [rbp-8]
                                     sub     rax, 8
                                     jmp     .L3

where .L3 is some complex set of instructions, while, in absence of the virtual function, instructions are like so

    UObject* fp = (UObject*) ptr;    mov     rax, QWORD PTR [rbp-8]
                                     mov     QWORD PTR [rbp-16], rax

In presence of the virtual function (or vtable) there is an instruction to subtract 8 bits from rax which creates the offset in addresses shown in the out put. This example was taken from stackoverflow post.

In the book “Effective C++, third edition”, item 27, Scott Myers points that

… a single object (e.g., an object of
type Derived) might have more than one address (e.g., its address
when pointed to by a Base* pointer and its address when pointed to by
a Derived* pointer). That can’t happen in C. It can’t happen in Java. It
can’t happen in C#. It does happen in C++.

Posted by

ravimohan

Posted on

June 1, 2023

Posted under

The holographic view

Comments

1 Comment

Addressing The Addresses

In my books, computers are the best machines humans have ever built. A big wave to Alan Turing, you are the thinker I may resent if at all (well Paul Dirac and Darwin are in the same league). All Turing machines (i.e computers) have tape to read from and to write to, like so

The purpose of this blog-post is to understand the implications of “how” the tape is read. Based on this notion of “how” variety of “meanings” (software data-types) emerge if considered at low enough level of abstraction (assembly language).

We will take a live example from Karma, where we have a UObject pool allocator. The idea is to allocate a block of memory for UObject, for instance, AActor spawn (which is done in the routine SpawnActor()). Another example is UClass (which is great grand child class of UObject). Our pool allocator, GUObjectAllocator, allocates the space from the blue strip, representing Karma’s pre-allocated memory, and returns the address in red, meaning the subsequent block of memory addresses, determined by the size of UClass, have been reserved.

However, the following error message (pardon the Normal block address which is not specific to the example I have in mind, written above) started appearing on application close (or, to be precise, on freeing the blue block of memory at application close)

Thus began my search for the grand resolution of the issue. I posted the error message in CppIndia Discord Server. In the response, I was suggested to use Address Sanitizer (ASAN) to look out for the cause of such issue. And sure enough, Xcode ASAN greeted me with the following

==36231==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x00012ad047f8 at pc 0x00010084d208 bp 0x00016fdfcc20 sp 0x00016fdfc3e0
WRITE of size 64 at 0x00012ad047f8 thread T0
    #0 0x10084d204 in __asan_memset+0x104 (libclang_rt.asan_osx_dynamic.dylib:arm64e+0x41204) (BuildId: f0a7ac5c49bc3abc851181b6f92b308a32000000200000000100000000000b00)
    #1 0x102a971d4 in Karma::FGenericPlatformMemory::Memzero(void*, unsigned long) GenericPlatformMemory.h:72
    #2 Callstack .....

0x00012ad047f8 is located 8 bytes to the left of 1280000-byte region [0x00012ad04800,0x00012ae3d000)
allocated by thread T0 here:
    #0 0x10084ee68 in wrap_malloc+0x94 (libclang_rt.asan_osx_dynamic.dylib:arm64e+0x42e68) (BuildId: f0a7ac5c49bc3abc851181b6f92b308a32000000200000000100000000000b00)
    #1 0x102aa5b08 in Karma::FMemory::SystemMalloc(unsigned long) KarmaMemory.h:124
    #2 0x102aa5a24 in Karma::KarmaSmriti::StartUp() KarmaSmriti.cpp:25
    #3 0x1029ab45c in Karma::Application::PrepareMemorySoftBed() Application.cpp:65
    #4 0x1029aade8 in Karma::Application::PrepareApplicationForRun() Application.cpp:54
    #5 0x10000772c in main EntryPoint.h:79
    #6 0x195923f24  (<unknown module>)

Some report on Heap Overflow
==36231==ABORTING

I have put bold font to the text of interest. The FGenericPlatformMemory::MemZero() is doing the “illegal” writing to the block of memory at address 0x00012ad047f8 which was not the address, 0x00012ad04800, returned by GUobjectAllocator. Furthermore this fact is reinforced by the message “0x00012ad047f8 is located 8 bytes to the left of 1280000-byte region [0x00012ad04800,0x00012ae3d000)”. So who or rather how is this offset of 8 bytes is being introduced?

The typecasting done while generating the UClass object, here

ReturnClass = (UClass*)GUObjectAllocator.AllocateUObject(sizeof(UClass), alignof(UClass), true);

is the reason for the offset of 8 bytes leading to the error message because of writing in the place that is not supposed to be written by the app. I have marked the “illegal” block of memory with pink in the cartoon above. The rectification is simple enough

ReturnClass = reinterpret_cast<UClass*>(GUObjectAllocator.AllocateUObject(sizeof(UClass), alignof(UClass), true));

The reinterpret_cast basically type casts the data type without introducing offsets in the address. Thus the conversion from UObjectBase* to UClass* is achieved with ReturnClass having the address value of 0x00012ad04800, which is the legal block of memory reserved by Karma’s pool allocator.

This might raise a question on the comparative working of reinterpret_cast and C-style cast that we leave to the future. A thing that can be said is for that we will be needing assembly language equivalent of the code, something along the lines of this article.

Posted by

ravimohan

Posted on

August 12, 2022

Posted under

The holographic view

Comments

Leave a comment

Towards Cross-Platform Based Mindset (I)

I believe a good game-dev environment should be capable of supporting all-platform philosophy and this can be achieved by being open enough to strive for enough learning and compassion.

The purpose of this blog-post series is to explore and show just that. We begin with the study of a cross-platform real-time graphics driver.

Vulkan is a cross-platform 3D graphics API and graphics driver meant for Real Time rendering. The best feature, for me personally, is the degree of control provided with the fortune of low-level programming tools which clears up the graphics rendering pipeline concept for enthusiasts and delivers major lesson for the uninitiated (who survive at the convenience of easy GL type function calls, not to say I’m not Glad, pun intended). Our friend Cherno is a great propagator of the API who introduced me to Vulkan. But we gonna extend his legacy and put the concept of cross-platform to test with all the stress-tests known to tech-kind.

Along with the official tutorial I have consulted Travis Vroman’s videos and some off-mainstream videos. In context of Karma, we don’t need to hook Vulkan as sole beneficiary but rather as any other regular library with enough fluidity of switching like so

#include "RendererAPI.h"

namespace Karma
{
    RendererAPI::API RendererAPI::s_API = RendererAPI::API::OpenGL;
}

Here we see it is matter of single line to change the rendering API for Karma. Clearly I plan to implement Metal/DirectX in future. Remember we need a decoupled way of hooking the library for cross-platform reasons. This results in the following folder hierarchy generating enough basis for the abstraction we are striving for

KarmaEngine  
│
└───Karma
│   │
│   └───src
│   │   │
│   │   └───Karma
│   │   │   │
│   │   │   └───Animations
│   │   │   └───Events
│   │   │   └───Renderer
│   │   │   Input.h
│   │   │   Core.h
│   │   │   Log.h
│   │   │   ...
│   │   │
│   │   └───Platform
│   │       │
│   │       └───Linux
│   │       └───Mac
│   │       └───OpenGL
│   │       └───VUlkan
│   │       └───Windows
│   │        
│   │   Karma.h
│   │   ...
│   │
│   └───vendor 
│   │   │
│   │   └───assimp
│   │   └───Glad
│   │   └───GLFW
│   │   └───GLM
│   │   └───glslang
│   │   └───ImGui
│   │   └───spdlog
│   │   └───stb 
│   │   ...
│
└───Application
    │   
    └───src
    │   KarmaApp.cpp
    │   ...

The entities thus formed are equipped with the ability to hook with high specificity and moral values (minding own business and not interfering with others in any sense). Then it becomes my job to deal with the decisions, for instance, how much ground to cover with how much depth such as to maintain the overall harmony in Karma. At this point in time, the following flow-chart captures the essence of Karma

Clearly, Vulkan is part of Rendering Engine, which, is an abstraction that can be visualized as a magical (not mystic though) box with the following diagram

The layer takes care of what platform we are running on and which rendering API we are rocking with. Then the concept of cross matching (within the scope of API-Platform support alliances) is obvious. Let me outline of the Rendering Engine (RE) functioning. In a typical Rendering loop, the following lines of code are a must

virtual void OnUpdate(float deltaTime) override
{
    Karma::RenderCommand::SetClearColor(/* Some Color */);

        Karma::RenderCommand::Clear();

    Karma::Renderer::BeginScene();

    /* Process materials and Bind Vertex Array. */

    Karma::Renderer::Submit(/* Some Vertex Array */);

    Karma::Renderer::EndScene();
}

Now we would want our APIs (Vulkan/OpenGL or whatever) to conform to these simple static function calls. That is where the RE magic comes in. No matter what the working philosophy of graphics API is, we need to channel that into a working set of routines such that the above OnUpdate function, called in the game loop, which can be identified with Karma's Rendering Handle in the flow chart, makes sense (works). Let me define and explain the actors of this play

Vertex Array: Can be imagined synonymous to a well behaved container of Mesh and Material (defined here). Each API should have its corresponding instantiation of Vertex Array.
Render Command: Is a static class with the rendering structure (curtsey TheCherno) and with the runtime-polymorphism responsibility to direct the calls to appropriate API (specified here). It holds the reference to RendererAPI which could be Vulkan or OpenGL after doing their initialisation specified in RendererAPI.cpp.
Renderer: A data-oriented static class containing the scene information (for instance camera) for relevant processing, and, rendering API information for Karma.

There is an entire branch in which we attempt to hook Vulkan API and which succeeded almost as expected (with some reservations and I plan to do a stress test with large number (millions perhaps billions of triangles)). But let me first chalk out the partial-abstractions acting like an interface between the static routines, in the above block, and low level Vulkan commands. The following are the Vulkan adaptations of Karma’s abstract classes.

Vulkan Shader (derived from Shader)
Vulkan Buffer (derived from Buffer)
Vulkan Context (derived from Graphics Context)
Vulkan RendererAPI (derived from RendererAPI)
Vulkan VertexArray (derived from VertexArray)

Please note I am using the video as raw material and vulkan-tutorial for regular checks in what I understand and write henceforth.

The goal for any decent graphics API is to provide means (tangible and intangible) to map the coordinates of a triangle (read my blog-post for underlining importance of triangles in Real Time rendering) to the pixels on monitor or screen. This is it! What basically creates the market for various APIs IS the need for a hardware friendly algorithm which is considerate enough not only for scaling to billion triangles but also for providing natural fit to the taste of the programmer. Taste may include various levels of legitimate control based on what the programmer may think serves the purpose.

To be continued…

Ravi Mohan

If you haven't found something strange during a day, it hasn't been much of a day. – J. A. Wheeler

Category Archives: The holographic view

Fun With Templates

Blood Splat

C++ Typecasts – An Assembly POV

Addressing The Addresses

Towards Cross-Platform Based Mindset (I)