rototiller/.git - Collection of software-rendered graphics hacks supporting libdrm and SDL2

Age	Commit message (Collapse)	Author
2018-02-20	rototiller: rudimentary argv parsing scaffolding	Vito Caputo
	Nothing wired up yet.
2018-02-20	settings: introduce abstract settings	Vito Caputo
	Settings will be used to express configurable parameters in the rendering modules and fb backends. The goal is to address both commandline argument setting of parameters, automatic use of defaults, as well as interactive configuration including the outputting of the resulting settings in a form usable as a commandline for future reuse. Since settings can be numerous and highly varied from one module or backend to another, a form similar to the Linux kernel's cmdline or QEMU's approach has been adopted. For example, a complete DRM backend, card selection and config would be: rototiller --video=drm,dev=/dev/dri/card0,connector=LVDS-1,mode=1024x768@60 If any of the above were omitted, then the missing settings would be interactively configured. If you added --defaults, then any omissions would be automatically filled in with the defaults. i.e. rototiller --video=drm,dev=/dev/dri/card4 --defaults would use the preferred connector and mode for that card. rototiller --video=drm --defaults would do the same but also default to the /dev/dri/card0 path. for brevity, I omitted rendering modules from above, but the same approach applies simply to --module=: rototiller --module=sparkler --video=drm --defaults If you ran rototiller without any arguments, then a fully interactive setup would ensue for module and video. If you ran rototiller with just --defaults, then everything is defaulted for you. A default rendering module will be used (the original roto renderer, probably). Note that this commit only adds scaffolding to make this possible, none of this is wired up yet.
2018-01-01	fb: switch over to fb_ops_t abstraction	Vito Caputo
	Remove everything drm-related from fb.c, utilizing the implementation in drm_fb.c instead.
2018-01-01	drm_fb: implement drm fb backend	Vito Caputo
	Largely mechanical copying of the drm code into the new fb_ops_t abstraction. Dormant for now.
2018-01-01	fb: introduce fb_ops_t	Vito Caputo
	Hooks for fb acquire/release, page allocate, free, and flip. This should encompass everything currently needed for the drm backend, which will move behind this abstraction in a later commit.
2017-12-31	fb: combine page flip with wait	Vito Caputo
	Tidying this up a bit in preparation of ripping out all drm-specific stuff from fb.[ch]. Future commits will refactor fb.c to utilize an fb_ops_t for hooks to allocate, flip, and free pages.
2017-12-23	ray: constify input scene and camera parameters	Vito Caputo
	also const the ray_euler_t basis
2017-12-23	ray: constify all ray_3f_t method parameters	Vito Caputo

2017-12-23	ray: split object render from object description	Vito Caputo
	This moves the per-object _prepared state into ray_render_object_$type structs with all the rendering-related object methods switched to operate on the new render structs. Since the current rendering code just makes all these assumptions about light objects being point lights, I've just dropped all the stuff associated with rendering light objects for now. I think it will be refactored a bit later on when the rendering code stops hard-coding the point light stuff. These changes open up the possibility of constifying the scene and constituent objects, now that rendering doesn't shove the prepared state into the embedded _prepared object substructs.
2017-12-10	ray: split scene data from render state	Vito Caputo
	This introduces ray_render_t, and ray_render.[ch]. The _prepared member of ray_scene_t has been moved to ray_render_t, and the other _prepared members (e.g. objects) will follow. Up until now I've just been sticking the precomputed state under _prepared members of their associated structures, and simply using convention to enforce anything resembling an api boundary. It's been convenient without being inefficient, but I'd like to move the ray code into more of a reusable library and this wart needs to be addressed. The render state is also where any spatial indexes will be built and maintained, another thing I've been experimenting with. Note most of the churn here is just renaming ray_scene.c to ray_render.c. A nearly global s/ray_scene/ray_render/ has occurred, now that ray_scene_t really only serves as glue to bind objects, lights, and scene-global properties into a cohesive unit.
2017-12-10	rototiller: introduce module.finish_frame()	Vito Caputo
	Add a hook for post-render serialized frame completion, some of the renderers may have state to cleanup after rendering a frame. A future commit may change add a return value to control flow for features like multi-pass rendering within a given module. The raytracer for example may want to add concurrently executed post filters, and having a non-void return from finish_frame() would be a tidy way to tell rototiller "go back to prepare->render for this context" as many times as necessary, keeping the pass state in the context. For now its return is void however, as I just need a cleanup hook as the raytracer becomes more stateful per frame with a BIH spatial index in the works.
2017-12-10	ray: add module context ray_context_t	Vito Caputo
	Before I can clean up the ray_scene_t._prepared kludge I need a place to keep state from frame prepare to render, enter context. Future commits will migrate the _prepared stuff into a separate ray_render_t which is constructed on prepare then acted on in fragment render. Then spatial acceleration structures may be added, constructed at prepare phase and shared across the concurrent rendering.
2017-12-10	ray: trivial formatting changes	Vito Caputo
	Remove some extraneous indentation
2017-09-29	ray: remove unused ray_scene_t.n_{lights,objects}	Vito Caputo
	Commit 445e94 switched to using sentinel objects, but missed removal of these obsoleted object counts.
2017-09-17	ray: stop recurring below a relevance threshold	Vito Caputo
	There's no point computing more reflections if they're not going to contribute substantially to the resulting sample. Previously the max depth threshold solely controlled how many times a given ray could reflect, this commit introduces a minimum relevance as well. Value may require tuning, may actually make sense to move into the scene description as a parameter. Brings a minor frame rate improvement.
2017-09-15	modules/*: cease dividing stride by 4	Vito Caputo
	Just cast buf to (void *) for the pointer arithmetic, stride is in units of bytes and no assumptions should be made about its value such as divisability by 4.
2017-09-14	fb: update copyright line	Vito Caputo

2017-09-14	fb: s/fb_fragment_divide_single/fb_fragment_slice_single/	Vito Caputo
	Mechanical cosmetic change
2017-09-14	ray: switch to the tiling fragmenter	Vito Caputo

2017-09-14	fb: implement a tiling fragmenter	Vito Caputo

2017-09-14	*: use fragment generator	Vito Caputo
	Rather than laying out all fragments in a frame up-front in ray_module_t.prepare_frame(), return a fragment generator (rototiller_fragmenter_t) which produces the numbered fragment as needed. This removes complexity from the serially-executed prepare_frame() and allows the individual fragments to be computed in parallel by the different threads. It also eliminates the need for a fragments array in the rototiller_frame_t, indeed rototiller_frame_t is eliminated altogether.
2017-09-14	util: add MIN/MAX macros	Vito Caputo

2017-09-14	ray: simplify object iterators using sentinel type	Vito Caputo
	Trivial optimization eliminates some instructions from the hot path, no need to maintain a separate index from the current object pointer.
2017-09-13	ray: cleanup ray_camera_frame_t fragments	Vito Caputo
	Previously every fb_fragment_t (and thus thread) was constructing its own ray_camera_frame_t view into the scene, duplicating some work. Instead introduce ray_camera_fragment_t to encapsulate the truly per-fragment state and make ray_scene_render_fragment() operate on just this stuff with a reference to a shared ray_camera_frame_t prepared once per-frame. Some minor ray_camera.c cleanups sneak in as well (prefer multiply instead of divide, whitespace cleanups...)
2017-09-12	ray: don't assume x_alpha is 0 at begin or y_step	Vito Caputo
	Currently fragments always start at the left edge of the frame, but when switching to a tiling fragmenter this is no longer true and causes visible errors.
2017-08-15	ray: misc computational fixups	Vito Caputo
	ray:object intersection coordinates were incorrectly being computed relative to the ray origin using a subtraction instead of addition, a silly mistake with surprisingly acceptable results. Those results were a result of other minor complementary mistakes compensating to produce reasonable looking results. In the course of experimenting with an acceleration data structure it became very apparent that 3d space traversal vectors were not behaving as intended, leading to review and correction of this code.
2017-08-07	rototiller: remove unused variable	Vito Caputo

2017-08-07	ray: more fragments for better thread utilization	Vito Caputo
	For now, a simple cpu multiplier of 64 is used. fb_fragment_t needs a tiling fragment divider added...
2017-08-07	threads: rework threaded fragment scheduling	Vito Caputo
	Instead of creating fragment lists striped across available threads uniformly in a round-robin fashion, just have the render threads iterate across the shared fragments array using atomics. This way non-uniform cost of rendering can be adapted to, provided the module prepares the frame with sufficient fragment granularity. In the ray tracer for example, it is quite common for some areas of the screen to have lower complexity/cost than others. The previous model distributed the fragments uniformly across the threads with no ability for underutilized threads to steal work from overutilized threads in the event of non-uniform cost distributions. Now no attempt to schedule work is made. The render threads simply race with eachother on a per-frame basis, atomically incrementing a shared index into the frame's prepared fragemnts. The fragment size itself represents the atomic work unit. A later commit will change the various renderers to prepare more/smaller fragments where appropriate. The ray tracer in particular needs more and would probably further benefit from a tiling strategy, especially when an acceleration data structure is introduced.
2017-06-03	ray: convert from recursive to iterative tracing	Vito Caputo
	Small speedup, I personally find the code cleaner this way too. Everything in the hot path should now be inlined, no function calls.
2017-06-02	ray: skip intersection tests on reflector objects	Vito Caputo
	We can just assume the object which reflected the ray being tracing isn't going to be intersected. Maybe later this assumption no longer holds true, but it is true for now.
2017-06-02	ray: precompute primary ray for ray_object_sphere_t	Vito Caputo
	This gets rid of some computation on the primary ray:plane intersection tests The branches on depth suck though... I'm leaning towards specialized primary ray intersection test functions.
2017-06-02	ray: precompute primary ray for ray_object_plane_t	Vito Caputo
	This gets rid of some computation on the primary ray:plane intersection tests
2017-06-02	ray: plumb depth and camera to objects	Vito Caputo
	To enable prepare to precompute aspects of primary rays which all have a common origin at the camera, bring the camera to ray_object_prepare() and bring the depth to ray_object_intersects_ray() for primary ray detection. This is only scaffolding, functionally unchanged.
2017-06-02	ray: separate lights from objects	Vito Caputo
	This may need to be undone in the future when more sophisticated lights, like area lights, are implemented. For now I can avoid polluting the objects list with the lights by strictly separating them.
2017-06-02	ray: simplify trace_ray inner loop slightly	Vito Caputo
	Remove unnecessary nearest_object check, the distance comparison alone is sufficient when initialized to INFINITY.
2017-06-01	ray: move shadow check to a function	Vito Caputo
	Just tidying up shade_ray() before more optimizations.
2017-06-01	ray: perform ambient light color scale in prepare	Vito Caputo
	Trivially removes a ray_3f_mult_scalar() from the hot path.
2017-06-01	ray: move max depth check out of trace_ray()	Vito Caputo
	We can avoid some unnecessary work at the max depth by checking it in shade_ray() instead.
2017-05-27	ray: inline ray_object_* switch functions	Vito Caputo

2017-05-27	ray: simplify ray_3f_normalize()	Vito Caputo
	This is functionally identical.
2017-05-27	ray: redo ray_3f_distance()	Vito Caputo
	This function isn't currently used, but its implementation was awful.
2017-05-27	ray: normalize direction missed in 28d8022	Vito Caputo
	Need to normalize the direction when we step the y axis and @ start.
2017-05-27	ray: use approximate power in specular reflection	Vito Caputo
	powf() is slow but precise, this isn't the fastest method but it's at least portable and a bit faster.
2017-05-26	ray: s/nlerp/lerp/ where normalize is unnecessary	Vito Caputo
	It's only necessary to normalize the direction stored vector in x_step(), the rest can simply be linearly interpolated which saves some divides.
2017-05-12	ray: mult normalize in ray_object_sphere_normal	Vito Caputo
	Simple optimization taking advantage of the prepare, mults generally are cheaper than divs.
2017-05-12	ray: add ray_scene_prepare() object precomputing	Vito Caputo
	Just embed a _prepared struct in the object where precomputed stuff can be cached. Gets called once before rendering, which ends up calling the object-specific ray_object_$type_prepare() methods per object.
2017-04-27	*: remove vestigial module/${module}/${module}.h	Vito Caputo
	Prior to rototiller_module_t these headers were included and the module-specific render functions called directly. That's no longer the case, these files are irrelevant today.
2017-04-27	sparkler: enable rudimentary threaded rendering	Vito Caputo
	This moves most of the particle system maintenance into the serially executed sparkler_prepare_frame(), divides the frame into ncpus fragments, and leaves the draw to occur concurrently. The drawing must still currently process all particles and simply skips drawing those falling outside the fragment. Moving more of the computation out of prepare_frame() and into render_fragment() is left for future improvements, as it's a bit complex to do gainfully.
2017-04-27	sparkler: respect fragment->frame_{width,height}	Vito Caputo
	should_draw_expire_if_oob() assumed the fragment represented the entire frame. Instead, return 0 if the coordinates are outside the fragment, but only reset longevity if outside of the frame. If sparkler goes threaded in the drawing, this would result in threads simply skipping particles outside the fragment. The longevity reset occurring in all threads appears suspicious but should be benign since they all write the same thing - 0.