rototiller/.git - Collection of software-rendered graphics hacks supporting libdrm and SDL2

Age	Commit message (Collapse)	Author
2017-08-07	ray: more fragments for better thread utilization	Vito Caputo
	For now, a simple cpu multiplier of 64 is used. fb_fragment_t needs a tiling fragment divider added...
2017-08-07	threads: rework threaded fragment scheduling	Vito Caputo
	Instead of creating fragment lists striped across available threads uniformly in a round-robin fashion, just have the render threads iterate across the shared fragments array using atomics. This way non-uniform cost of rendering can be adapted to, provided the module prepares the frame with sufficient fragment granularity. In the ray tracer for example, it is quite common for some areas of the screen to have lower complexity/cost than others. The previous model distributed the fragments uniformly across the threads with no ability for underutilized threads to steal work from overutilized threads in the event of non-uniform cost distributions. Now no attempt to schedule work is made. The render threads simply race with eachother on a per-frame basis, atomically incrementing a shared index into the frame's prepared fragemnts. The fragment size itself represents the atomic work unit. A later commit will change the various renderers to prepare more/smaller fragments where appropriate. The ray tracer in particular needs more and would probably further benefit from a tiling strategy, especially when an acceleration data structure is introduced.
2017-06-03	ray: convert from recursive to iterative tracing	Vito Caputo
	Small speedup, I personally find the code cleaner this way too. Everything in the hot path should now be inlined, no function calls.
2017-06-02	ray: skip intersection tests on reflector objects	Vito Caputo
	We can just assume the object which reflected the ray being tracing isn't going to be intersected. Maybe later this assumption no longer holds true, but it is true for now.
2017-06-02	ray: precompute primary ray for ray_object_sphere_t	Vito Caputo
	This gets rid of some computation on the primary ray:plane intersection tests The branches on depth suck though... I'm leaning towards specialized primary ray intersection test functions.
2017-06-02	ray: precompute primary ray for ray_object_plane_t	Vito Caputo
	This gets rid of some computation on the primary ray:plane intersection tests
2017-06-02	ray: plumb depth and camera to objects	Vito Caputo
	To enable prepare to precompute aspects of primary rays which all have a common origin at the camera, bring the camera to ray_object_prepare() and bring the depth to ray_object_intersects_ray() for primary ray detection. This is only scaffolding, functionally unchanged.
2017-06-02	ray: separate lights from objects	Vito Caputo
	This may need to be undone in the future when more sophisticated lights, like area lights, are implemented. For now I can avoid polluting the objects list with the lights by strictly separating them.
2017-06-02	ray: simplify trace_ray inner loop slightly	Vito Caputo
	Remove unnecessary nearest_object check, the distance comparison alone is sufficient when initialized to INFINITY.
2017-06-01	ray: move shadow check to a function	Vito Caputo
	Just tidying up shade_ray() before more optimizations.
2017-06-01	ray: perform ambient light color scale in prepare	Vito Caputo
	Trivially removes a ray_3f_mult_scalar() from the hot path.
2017-06-01	ray: move max depth check out of trace_ray()	Vito Caputo
	We can avoid some unnecessary work at the max depth by checking it in shade_ray() instead.
2017-05-27	ray: inline ray_object_* switch functions	Vito Caputo

2017-05-27	ray: simplify ray_3f_normalize()	Vito Caputo
	This is functionally identical.
2017-05-27	ray: redo ray_3f_distance()	Vito Caputo
	This function isn't currently used, but its implementation was awful.
2017-05-27	ray: normalize direction missed in 28d8022	Vito Caputo
	Need to normalize the direction when we step the y axis and @ start.
2017-05-27	ray: use approximate power in specular reflection	Vito Caputo
	powf() is slow but precise, this isn't the fastest method but it's at least portable and a bit faster.
2017-05-26	ray: s/nlerp/lerp/ where normalize is unnecessary	Vito Caputo
	It's only necessary to normalize the direction stored vector in x_step(), the rest can simply be linearly interpolated which saves some divides.
2017-05-12	ray: mult normalize in ray_object_sphere_normal	Vito Caputo
	Simple optimization taking advantage of the prepare, mults generally are cheaper than divs.
2017-05-12	ray: add ray_scene_prepare() object precomputing	Vito Caputo
	Just embed a _prepared struct in the object where precomputed stuff can be cached. Gets called once before rendering, which ends up calling the object-specific ray_object_$type_prepare() methods per object.
2017-04-27	*: remove vestigial module/${module}/${module}.h	Vito Caputo
	Prior to rototiller_module_t these headers were included and the module-specific render functions called directly. That's no longer the case, these files are irrelevant today.
2017-04-27	sparkler: enable rudimentary threaded rendering	Vito Caputo
	This moves most of the particle system maintenance into the serially executed sparkler_prepare_frame(), divides the frame into ncpus fragments, and leaves the draw to occur concurrently. The drawing must still currently process all particles and simply skips drawing those falling outside the fragment. Moving more of the computation out of prepare_frame() and into render_fragment() is left for future improvements, as it's a bit complex to do gainfully.
2017-04-27	sparkler: respect fragment->frame_{width,height}	Vito Caputo
	should_draw_expire_if_oob() assumed the fragment represented the entire frame. Instead, return 0 if the coordinates are outside the fragment, but only reset longevity if outside of the frame. If sparkler goes threaded in the drawing, this would result in threads simply skipping particles outside the fragment. The longevity reset occurring in all threads appears suspicious but should be benign since they all write the same thing - 0.
2017-04-27	fb: apply fragment coordinates put pixel helper	Vito Caputo
	fb_fragment_put_pixel_unchecked() assumed the x,y coordinates of the fragment were always 0.
2017-04-26	sparkler: utilize context struct for module state	Vito Caputo

2017-04-26	sparkler: add virtual flag to particle_props_t	Vito Caputo
	The burst particle abused a zero mass to circumvent the effects of aging. Instead use an explicit virtual flag to suppress aging, change busrt_cb to ignore all virtual particles instead of only its center. Previously burst_cb would thrust other bursts like any other particle, and this was incorrect. Now burst centers are always stationary, even when they overlap other bursts.
2017-04-22	roto: utilize context struct for module state	Vito Caputo

2017-04-22	plasma: utilize context struct for module state	Vito Caputo

2017-04-22	julia: utilize context struct for module state	Vito Caputo

2017-04-22	*: add module context machinery	Vito Caputo
	introduces create_context() and destroy_context() methods, and adds a 'void *context' first parameter to the module methods. If a module doesn't supply create_context() then NULL is simply passed around as the context, so trivial modules can continue to only implement render_fragment(). A subsequent commit will update the modules to encapsulate their global state in module-specific contexts.
2017-04-22	ray: remove vestigial ray_threads code	Vito Caputo
	Now that rototiller is generally threaded when a prepare_frame() method is supplied, and modules/ray has been updated accordingly, discard the now redundant ray-specific threading code and related stuff.
2017-04-22	ray: convert to generalized threaded rendering	Vito Caputo
	The ray tracer was already threaded, so this required little change other than making some state global like the previous commits, and calling the underlying non-threaded single-fragment scene renderer function. A future commit will discard the now vestigial ray_threads related code.
2017-04-22	roto: enable threaded rendering	Vito Caputo
	Same basic changes as the previous commits made to julia and plasma.
2017-04-22	plasma: enable threaded rendering	Vito Caputo
	Same general procedure as the previous commit made to the julia module.
2017-04-22	julia: enable threaded rendering	Vito Caputo
	Move maintenance of per-frame variables into julia_prepare_frame(), which requires making them static file-scope globals for now. Also make minor adjustments to the code to make less assumptions about the fragment being rendered (like it's x/y coordinates being 0, etc.) A future commit will probably add an initializer function to rototiller_module_t, with an opaque pointer output which will be fed to all the module methods so these globals can be encapsulated and instantiated.
2017-04-22	rototiller: add threaded rendering	Vito Caputo
	This is a simple worker thread implementation derived from the ray_threads code in the ray module. The ray_threads code should be discarded in a future commit now that rototiller can render fragments using threads. If a module supplies a prepare_frame() method, then it is called per-frame to prepare a rototiller_frame_t which specifies how to divvy up the page into fragments. Those fragments are then dispatched to a thread per CPU which call the module's rendering function in parallel. There is no coupling of the number of fragments in a frame to the number of threads/CPUs. Some modules may benefit from the locality of tile-based rendering, so the fragments are simply dispatched across the available CPUs in a striped fashion. Helpers will be added later to the fb interface for tiling fragments, which modules desiring tiled rendering may utilize in their prepare_frame() methods. This commit does not modify any modules to become threaded, it only adds the scaffolding.
2017-04-22	rototiller: add prepare_frame & rototiller_frame_t	Vito Caputo
	Undecided on wether rototiller_frame_t should be fb_frame_t or not, may change later.
2017-04-22	fb: add frame_{width,height} to fb_fragment_t	Vito Caputo
	Modules need to know the overall dimensions of the frame the fragment they're rendering is part of. Previously it wasn't really necessary since the fragments supplied to the modules had always been the full page, but that's changing. This commit also changes the julia module to use the frame dimensions, others will need updating as well.
2017-04-22	*: /render/render_fragment/ in rototiller_module_t	Vito Caputo
	Adding more context to the name in anticipation of adding a prepare_frame() method to the module struct.
2017-04-21	*: s/renderer/module/g	Vito Caputo
	Make consistent with the source directory structure naming.
2017-04-21	sparkler: remove vestigial draw.h from Makefile.am	Vito Caputo

2017-02-14	ray: increase highlight exponent on shiny sphere	Vito Caputo
	The highlight on the little green sphere was white-washing the entire thing due to its high specular reflection value. This produces more reasonable results...
2017-02-14	ray: add highlight exponent to ray_surface_t	Vito Caputo
	Was a constant at 20, this allows it to be specified per-object.
2017-02-12	ray: tweak surface properties	Vito Caputo
	Make spheres a little more diverse in terms of specular/diffusion, and minor tweak to the plane color.
2017-02-12	ray: update copyright years	Vito Caputo

2017-02-12	ray: improve camera movement	Vito Caputo
	Make the camera orbit around the origin at a varying radius, with kept aimed facing the origin, with some vertical sweep+tilt thrown in.
2017-02-12	julia: add a morphing Julia set renderer	Vito Caputo
	This is unoptimized, with a palette slapped together in vim, but still pretty neat!
2017-02-12	ray: fixup ray_object_plane_intersects_ray() bug	Vito Caputo
	We should only consider dot products > 0 as intersected, or >= something very close to 0 (epsilon). As-is resulted in planes moving with camera movement along the plane normal axis. Also fixes plane distance to be non-negative in the current scene.
2017-02-12	ray: remove vestigial stdio.h include	Vito Caputo
	Leftover from debugging presumably
2017-02-10	ray: implement all orders in ray_euler_basis()	Vito Caputo
	Originally I only implemented pitch->yaw->roll, and being new to all this didn't fully appreciate the limitation that resulted in. This adds all six permutations of pitch/yaw/roll, the scene must specify the desired order when setting up the camera with the euler angles, see the enum in ray_euler.h.