| Age | Commit message (Collapse) | Author | 
|---|
|  | Initially I was going to make 32 vs. 64 be a setting, but decided
now that SDL is supported it's fairly likely there will be odd fb
dimensions (arbitrary window sizes).  Since this never really brought
anything of significant value, just drop the version that mostly
just demonstrated how to pack multiple pixels into a single u64 write
to the framebuffer more than anything else. | 
|  | This removes the submit-softly module, instead using a runtime
setting to toggle bilinear interpolation on the submit module. | 
|  |  | 
|  |  | 
|  | Viscosity and diffusion are supported, it'd be neat to add a
configurable size (the ROOT define) for the flow field in the
future.
I didn't go crazy here, it's just a list of orders of magnitude you
choose from for each.  It'd probably be more interesting to change
this into a single knob with descriptive names like "smoke" "goop"
"water" mapping to a LUT. | 
|  | s/Joe/Jos/, I should wear my glasses more. | 
|  | This implements near verbatim the code found in the paper titled:
Real-Time Fluid Dynamics for Games
By Jos Stam
It sometimes has the filename GDC03.PDF, or Stam_fluids_GDC03.pdf
The density field is rendered using simple linear interpolation of
the samples, in a grayscale palette.  No gamma correction is being
performed.
There are three configurable defines of interest:
VISCOSITY, DIFFUSION, and ROOT.
This module is only threaded in the drawing stage, so basically the
linear interpolation uses multiple cores.  The simulation itself is
not threaded, the implementation from the paper made no such
considerations.
It would be nice to reimplement this in a threaded fashion with a
good generalized API, then move it into libs.  Something where a unit
square can be sampled for interpolated densities would be nice.
Then extend it into 3 dimensions for volumetric effects... | 
|  | Remove the silly kludge avoiding peripheral cells | 
|  |  | 
|  | This substantially reworks the cell sampling in submit.
As a result, it's now threaded in the rendering phase which now
resembles a texture mapper sans transformations.
This produces a full-screen rendering rather than a potentially
smaller one when the resolution wasn't cleanly divisable by the grid
size.
A new module, named submit-softly has also been added to expose the
bilinearly interpolated variant.  The transition between cells is also
employing a smoothstep so it's not actually linear.
The original non-interpolated version is retained as well, at the same
submit module name.
Some minor cleanups happened as well, nothing worth mentioning, except
perhaps that the cells are now a uint8_t which is fine unless someone
tries to redefine NUM_PLAYERS > 255. | 
|  | Just making things consistent, also dropping unnecessary player
assert from submit module.  Future libs/grid may explore using
the "unassigned" zero player in taken calls for unassigning
cells like in simultaneously taken collision scenarios. | 
|  |  | 
|  | This module displays realtime battle for domination simulated
as 2D cellular automata.
This is just a test of the backend piece for a work-in-progress
multiplayer game idea.  The visuals were kind of interesting to
watch so I figured may as well merge it as a module to share.
Enjoy!
PS: the results can vary a lot by tweaking the defines in submit.c | 
|  | Rather than require adding -Isrc/libs/$lib to every Makefile.am for
every lib used, just add -Ilibs to those makefiles and prefix the lib
dir in the #include <> header paths.
Later I'll probably just move the -Isrc/libs someplace common so the
per-module Makefile.am doesn't need to bother with this stuff. | 
|  | This is the first step of breaking out all the core rendering stuffs
into reusable libraries and making modules purely compositional,
consumers of various included rendering/effects libraries.
Expect multiple modules leveraging libray for a variety of scenes and
such.  Also expect compositions mixing the various libraries for more
interesting visuals. | 
|  |  | 
|  | Fixes silly cosmetic error in configure output for checking libdrm... | 
|  | also const the ray_euler_t basis | 
|  |  | 
|  | This moves the per-object _prepared state into ray_render_object_$type
structs with all the rendering-related object methods switched to
operate on the new render structs.
Since the current rendering code just makes all these assumptions
about light objects being point lights, I've just dropped all the
stuff associated with rendering light objects for now.  I think it
will be refactored a bit later on when the rendering code stops
hard-coding the point light stuff.
These changes open up the possibility of constifying the scene and
constituent objects, now that rendering doesn't shove the prepared
state into the embedded _prepared object substructs. | 
|  | This introduces ray_render_t, and ray_render.[ch].
The _prepared member of ray_scene_t has been moved to ray_render_t,
and the other _prepared members (e.g. objects) will follow.
Up until now I've just been sticking the precomputed state under
_prepared members of their associated structures, and simply using
convention to enforce anything resembling an api boundary.  It's
been convenient without being inefficient, but I'd like to move
the ray code into more of a reusable library and this wart needs
to be addressed.
The render state is also where any spatial indexes will be built
and maintained, another thing I've been experimenting with.
Note most of the churn here is just renaming ray_scene.c to
ray_render.c.  A nearly global s/ray_scene/ray_render/ has occurred,
now that ray_scene_t really only serves as glue to bind objects,
lights, and scene-global properties into a cohesive unit. | 
|  | Before I can clean up the ray_scene_t._prepared kludge I need a
place to keep state from frame prepare to render, enter context.
Future commits will migrate the _prepared stuff into a separate
ray_render_t which is constructed on prepare then acted on in
fragment render.
Then spatial acceleration structures may be added, constructed
at prepare phase and shared across the concurrent rendering. | 
|  | Remove some extraneous indentation | 
|  | Commit 445e94 switched to using sentinel objects, but missed removal
of these obsoleted object counts. | 
|  | There's no point computing more reflections if they're not going
to contribute substantially to the resulting sample.  Previously
the max depth threshold solely controlled how many times a given
ray could reflect, this commit introduces a minimum relevance as
well.  Value may require tuning, may actually make sense to move
into the scene description as a parameter.
Brings a minor frame rate improvement. | 
|  | Just cast buf to (void *) for the pointer arithmetic, stride is in
units of bytes and no assumptions should be made about its value
such as divisability by 4. | 
|  | Mechanical cosmetic change | 
|  |  | 
|  | Rather than laying out all fragments in a frame up-front in
ray_module_t.prepare_frame(), return a fragment generator
(rototiller_fragmenter_t) which produces the numbered fragment
as needed.
This removes complexity from the serially-executed
prepare_frame() and allows the individual fragments to be
computed in parallel by the different threads.  It also
eliminates the need for a fragments array in the
rototiller_frame_t, indeed rototiller_frame_t is eliminated
altogether. | 
|  | Trivial optimization eliminates some instructions from the hot path,
no need to maintain a separate index from the current object pointer. | 
|  | Previously every fb_fragment_t (and thus thread) was constructing
its own ray_camera_frame_t view into the scene, duplicating some
work.
Instead introduce ray_camera_fragment_t to encapsulate the truly
per-fragment state and make ray_scene_render_fragment() operate
on just this stuff with a reference to a shared
ray_camera_frame_t prepared once per-frame.
Some minor ray_camera.c cleanups sneak in as well (prefer multiply
instead of divide, whitespace cleanups...) | 
|  | Currently fragments always start at the left edge of the frame, but
when switching to a tiling fragmenter this is no longer true and
causes visible errors. | 
|  | ray:object intersection coordinates were incorrectly being computed
relative to the ray origin using a subtraction instead of addition, a
silly mistake with surprisingly acceptable results.  Those results
were a result of other minor complementary mistakes compensating to
produce reasonable looking results.
In the course of experimenting with an acceleration data structure it
became very apparent that 3d space traversal vectors were not behaving
as intended, leading to review and correction of this code. | 
|  | For now, a simple cpu multiplier of 64 is used.
fb_fragment_t needs a tiling fragment divider added... | 
|  | Small speedup, I personally find the code cleaner this way too.
Everything in the hot path should now be inlined, no function calls. | 
|  | We can just assume the object which reflected the ray being tracing
isn't going to be intersected.  Maybe later this assumption no longer
holds true, but it is true for now. | 
|  | This gets rid of some computation on the primary ray:plane intersection tests
The branches on depth suck though... I'm leaning towards specialized primary
ray intersection test functions. | 
|  | This gets rid of some computation on the primary ray:plane intersection tests | 
|  | To enable prepare to precompute aspects of primary rays which all have a
common origin at the camera, bring the camera to ray_object*_prepare() and
bring the depth to ray_object*_intersects_ray() for primary ray detection.
This is only scaffolding, functionally unchanged. | 
|  | This may need to be undone in the future when more sophisticated lights,
like area lights, are implemented.  For now I can avoid polluting the
objects list with the lights by strictly separating them. | 
|  | Remove unnecessary nearest_object check, the distance comparison alone
is sufficient when initialized to INFINITY. | 
|  | Just tidying up shade_ray() before more optimizations. | 
|  | Trivially removes a ray_3f_mult_scalar() from the hot path. | 
|  | We can avoid some unnecessary work at the max depth by checking it in
shade_ray() instead. | 
|  |  | 
|  | This is functionally identical. | 
|  | This function isn't currently used, but its implementation was awful. | 
|  | Need to normalize the direction when we step the y axis and @ start. | 
|  | powf() is slow but precise, this isn't the fastest method but it's
at least portable and a bit faster. | 
|  | It's only necessary to normalize the direction stored vector in x_step(),
the rest can simply be linearly interpolated which saves some divides. |