Cranberry physics
Cranberry physics intro
Attention Work in progress
I want physics, ability to query the scene for purposes like audio occlusion and of course for game play queries. I took a look at Bullet and Jolt, decided to go with Jolt due to following reasons.
- Open source MIT
- Multithreaded
- Probably good enough for most of the use case I will ever have
- JobSystem integration is supported and simple
- Customizable filters are good enough to support my use cases
Jolt physics engine notes
Integrate Jolt JobSystem
Jolt’s JobSystem is just an interface with few simple rules.
- Jobs are created and destroyed with this interface.
- Each job can be included in only one barrier this makes things so much simpler.
- JobSystem, Barrier, Jobs has to be overridden to support custom behavior.
- Jobs are reference counted and lifetime is managed by that.
The goal here is to integrate the Jolt job system interface to use engine’s global JobSystem. Since our JobSystem supports single threaded mode we need two of job system and other interface implementations. One for full multithreaded mode with at least 1 worker thread and another for single threaded mode where each job on enqueue gets executed immediately.
Before we can start looking at JobSystem itself we have to provide a custom implementation for Barriers.
Barriers
Barriers are just a collection of Jobs that can be waited on for completion.
It has two function AddJob
and OnJobFinished
exposed as interface.
In our case
AddJob
just adds job handle or reference to a thread safe linked list. Adding a Job to barrier must add a reference to reference counter. Destroying barrier will wait for all tasks to complete.OnJobFinished
does not have to do anything as Job enqueueing happens at either Job construction if no dependencies or when number of dependencies drops to 0.
Barrier can use same implementation for both Single and Multithreaded mode. Exception is when waiting.
Wait
function flow
- In single threaded mode, Go through each job in the list
- Execute the ones that can be executed. Executing is not needed as enqueueing must execute the whole tree however it is okay to execute here as well.
- Remove the jobs that are done.
- Do this until there is until there is no more Jobs.
- Assert the list is empty to make sure there is no missing dependencies.
- Clear the rest of the jobs.
- In multithreaded mode the steps are bit different. We can also assume that barrier never gets waited from a worker thread as Jolt never uses barrier as a way to manage dependencies between jobs. This assumption or fact makes dead lock related issues to not happen when worker threads uses different queues. If inter job dependencies are handled using Barrier then we have problem if barrier wait is queued before queueing the dependent job in the same queue. In this case barrier waits for job queued in thread’s same queue which has no chance to dequeue, there by creating dead lock.
- CAS get the linked list head with
nullptr
. This is to prevent further adding of jobs to same linked list. - Count the number of jobs that are already enqueued or waiting for dependencies using some custom data structure.
- Collect those enqueued jobs in another container to be used with
copat::awaitAllTasks
. - Now wait for jobs using
copat::waitOnAwaitable
- Remove the done jobs from the linked list.
- Repeat the steps 2 to 5 until there is no more done jobs cleared from linked list.
- Assert the list is empty to make sure there is no missing dependencies.
- Clear the rest of the jobs.
- CAS get the linked list head with
Job
Each Job must hold some custom data to track coroutine and the job enqueued into the CoPaT JobSystem. Even though the Jolt’s Job has a done status check it cannot alone be relied of if going to destroy a coroutine. As coroutine might still be in progress even after the Jolt job is done.
Instead of overriding the Job I decided to contain the Job inside a JobPacket in cranberry engine.
namespace cbe::physics
{
struct JobPacket
{
JPH::Job joltJob;
/* Inline chain to next entry in Barrier's Job list, Do not need atomic here as head in Barrier will be atomic */
JobPacket *barrierNext;
struct MultiThreadData
{
CopatTask task;
/* For tracking if job is already enqueued, This is necessary since Barrier works outside the coroutine framework of CoPaT JobSystem */
std::atomic_bool bEnqueued;
};
struct SingleThreadData
{
JobPacket *next;
};
/* MultiThreadData task data is not needed for single threading */
union
{
MultiThreadData mt;
SingleThreadData st;
} taskData;
};
}
This way we can easily interop between JobPacket and Job.
JobSystem
There will be two implementation of JobSystem and one selected based on the multithreading constraints in CoPaT’s JobSystem.
In single threaded mode
GetMaxConcurrency
always returns 1CreateJob
will create jobs from a preallocated pool until it gets exhausted.FreeJob
returns the Job to preallocated pool or free the dynamically allocated memory.CreateBarrier
will create from a preallocated pool until it gets exhausted.DestroyBarrier
returns the barrier to preallocated pool or free the dynamically allocated memory.WaitForJobs
will call the single threaded variant of wait in Barrier implementation.QueueJob
andQueueJobs
immediately executes based on reentrant condition. Enqueueing while reentering will add the Job toJobPacket::taskData.st.next
in JobSystem’s queue list.
In multithreaded mode
GetMaxConcurrency
returns number of worker threads incopat::JobSystem
.CreateJob
will reuse already allocated job from a thread safe ring buffer pool. If no free jobs exists new job gets allocated and used.FreeJob
returns the Job to ring buffer. If ring buffer is full frees the allocation.CreateBarrier
Barriers are sparsely created so now it will be pool allocated. New barrier gets allocated from this thread unsafe pool allocator inside a lock.DestroyBarrier
returns the barrier to the thread unsafe pool allocator inside lock.WaitForJobs
will call the multi threaded variant of wait in Barrier implementation.QueueJob
andQueueJobs
Queues the Job for execution wrapped inside a coroutine. Fills theJobPacket::taskData.mt
with task details.
Filter interfaces
The physics engine’s collision or queries system can be controlled to provide customized behavior required for engine using several filter or table interfaces. This is where the collision profiles of engine can be integrated.
Each of the collision or query starts from the BroadPhaseQuery
then to NarrowPhaseQuery
using body and shape. Each body has an ObjectLayer
associated with it which is very important for filtering efficiently.
In order to filter at Broad Phase the BroadPhaseLayer
will be used. Each ObjectLayer must be associated with a BroadPhaseLayer
and keeping number of broad phase layer as low as possible is important. Each broad phase layer has its own unique quad tree of AABBs where all the Bodies with mapped ObjectLayer gets added to.
BroadPhaseLayerInterface and ObjectVsBroadPhaseLayerFilter
This mapping of ObjectLayer to BroadPhaseLayer is provided by BroadPhaseLayerInterface
. For now I have decided to have only three BroadPhaseLayers
- Moveable - For all ObjectLayer that can be moved and cannot be made moveable later. Like static level objects that has no interaction possible other than block collision and world queries.
- NonMoveable - For all ObjectLayer that can be moved even if created as static body and later moved using
SetPosition
. - Triggers - For all ObjectLayer that can be used as triggers or sensors.
The ObjectVsBroadPhaseLayerFilter
interface is what is used to determine if an ObjectLayer can collide with a BroadPhaseLayer. The ObjectLayer comes from body being tested. The BroadPhaseLayer comes from the BroadPhase tree that is being tested in.
ObjectLayerPairFilter
Once BroadPhase filter is successful for each Body’s AABB in tree ObjectLayerPairFilter
is invoke to decide whether to consider the Object layer each body belongs to. This check is done once for all the continuous bodies with same ObjectLayer, keep this in mind when deciding on what data to encode into ObjectLayer.
GroupFilter
GroupFilter is described to be used to filter simulation collisions at the global common level. The groups and subgroups can be assigned to each body to control the collisions. This GroupFilter is a way to simplify for an instance sub body collisions without expensive logics.
Some example use case will be
- Disable collisions between each adjacent elements in chain to keep the stability of the chain.
- Disable inter body smi collision between small moving parts in vehicles.
Right now I do not need GroupFilters for my use case but as soon as I have one I will update here on what and how to approach the case.
BodyFilter
This filter is used for queries and must be passed in with queries. So depending upon what is requested in a query we can issue different filter to support wider use case.
However the most important part I have in my mind right now is if we use a collider for both querying and simulation then we need to do additional look ups here to eliminate sim results. The way we do this checks and elimination depends on how we encode the ObjectLayer date into body and body user data.
ShapeFilter
This filter is used for queries and must be passed in with queries. So depending upon what is requested in a query we can issue different filter to support wider use case. However right now I do not see a point to have this implemented. As shapes alone might not be enough to check advanced filtering.
One use case I can think of is filter out complex shapes like mesh shape to reduce query complexity.
SimShapeFilter
SimShapeFilter
is used for sim to filter out shapes for each body here we can filter bodies based on data from userdata.
Example use cases
- Simple LOD between mesh shape and simple shape.
- Filter out ObjectLayer mask if the body is used for querying too.
ContactListener
ContactListener
is event callback to listen to and reject specific contact points. I do not have any use for this now.
Summary
Here is a short diagram of what I envision the filter interface will do in my engine now.
Engine physics
For optimizing the collision detection as mush as possible I have decided to make each physics body to have separate mask of ObjectLayers
.
However for coarse Object layer filter
these two will be encoded into the ObjectLayer
of each body.
Since each mask must be tested against another body’s Object layer itself we need to store both Object layer and body’s Object layer response.
For this to work we need following conditions
- Change Jolt to use 32bit for ObjectLayer, this can be done using
-DOBJECT_LAYER_BITS=32
cmake config - Limit maximum number of Engine Object Layers to
26(0-25)
. This is to use the higher 26 bits of ObjectLayer as body response mask for both simulation and query combined. - Use
one bit
for determining whether the mask has simulation response in it. Why simulation instead of query? This can be used to quickly reject simulation request if the there the body is never used for simulation. Where as for query if needs additional checks if sim is enabled to be sure if body needs to respond. - Use
5bits
for storing the body’s Object layer itself - ObjectLayer filter will filter layers coarsely
BodyFilter
orSimShapeFilter
filters the based on body’s sim or query response mask based on the filter used.
namespace cbe::physics
{
// same as sizeof(JPH::ObjectLayer) == 32bit
struct ObjectLayer
{
uint32 bodyLayer : 5;
uint32 isSimMask : 1;
uint32 responseMask : 26;
};
}
These are the only special and most important case when it comes to filtering need to be handled by physics engine. Rest of the settings can be created 1:1 with Jolt to keep the interface simple.
Comments