Rework core code and implement a new CUDA based GPU backend by bozbez · Pull Request #50 · emanuelev/supereight

bozbez · 2020-06-23T08:50:48Z

Benchmark results

Benchmarking results for the new CUDA backend, with 640x480 computation resolution and 1cm voxel size (left bars are for SDF, right for OFusion). Note the tracker has not (yet) been ported and was running on the CPU for all backends.

Summary of changes

Removed se_ prefix from root directories
Added .clang-format and auto formatted all except tools/ and apps/
Cleaned up various build warnings and simplified CMake files
Switched the Makefile to use Ninja
Moved tracker code to its own static library in tracking/
Templated octree on the node and block buffer type
Unified field implementations into voxel_traits
Extracted octree manipulating pipeline stages to new intermediate Backend class in backend/, which compiles to new static libraries supereight-backend-{openmp, cuda}-{sdf, ofusion}.a
Added allocator template to Image and added new 1D Buffer class
Added CUDA allocator and memory pool
Implemented CUDA accelerated allocation, integration and raycasting, tuned for desktop GPUs

Overview of new code structure

TODO

Move core buffer implementation from Image to Buffer, and make Image inherit Buffer
Extend Backend interface to include save/load, VTK dumping, etc
Add documentation to backend/
Clean up backend/src/openmp and extract common code to BackendBase and backend/src/common
Re-introduce the tests in core/test and extend for new functions
(Future) Port the CUDA tracking kernels from the slambench repo
(Future) Tune allocation for other GPUs than the GTX 1080 Ti
(Future) Investigate raycast optimizations, including voxel splatting and hardware trilinear interpolation

emanuelev

Great work, thanks a lot! Got few comments to get a discussion started, let me know what you think.

emanuelev · 2020-08-30T09:14:38Z

-add_subdirectory(se_core)
-add_subdirectory(se_shared)
-add_subdirectory(se_tools)
+set(CMAKE_CXX_FLAGS "${CMAKE_CXX_FLAGS} -std=c++14 -fdiagnostics-color=always -faligned-new")


is there a reason here to manually set CMAKE_CXX_FLAGS instead of add_compile_options?

emanuelev · 2020-08-30T09:15:55Z

+all:
 	mkdir -p build/
-	cd build/ && cmake -DCMAKE_BUILD_TYPE=Release \
+	cd build/ && cmake -GNinja -DCMAKE_BUILD_TYPE=Release \


I am not sure adding a dependency on Ninja would be helpful. Perhaps making it optional and reverting back to Make if not available?

emanuelev · 2020-08-30T09:30:16Z

+    template<typename OctreeT, typename HashType, typename IncF>
+    SE_DEVICE_FUNC static void buildAllocationList(HashType* allocation_list,
+        int reserved, IncF get_idx, const OctreeT& octree,
+        const Eigen::Vector3f& world_vertex, const Eigen::Vector3f& direction,
+        const Eigen::Vector3f& camera_pos, float depth_sample,
+        float noise_factor);


I am not sure about this. Usually traits class are quite lightweight and convey only type infos. I'd break this into two classes at least. Like having an ofusion/sdf class that specify and implements the buildAllocationList and raycast interface.

emanuelev · 2020-08-30T11:49:05Z


-inline float3 voxelToPos(const int3 p, const float voxelSize){
-  return make_float3(p.x * voxelSize, p.y * voxelSize, p.z * voxelSize);
+inline float3 voxelToPos(const int3 p, const float voxelSize) {


This should be deleted, leftover of the pre-eigen port.

emanuelev · 2020-08-30T12:08:29Z

@@ -0,0 +1,188 @@
+/*


Not sure about moving this to backend. Do you see a way to keep it in the core lib? Ideally core is everything octree-related but denseslam independent. Backend at this stage is somehow a mix which I'd rather avoid (think of someone that wants to use the lib without the whole denseslam dependency).

bozbez and others added 30 commits January 9, 2020 12:23

Remove se_ prefix from library dirs

96df11f

Clean up various compiler warnings

630a575

Temporarily remove tests

2570748

Improve dir structure and CMake project names

0b350e6

Improve header directory naming, switch to CMake ninja backend

5c41fd3

Fix CMake warnings

ac23007

Enable warnings globally

1e3b1d0

Enable coloured compiler output

3cc2236

Add .clang-format, clean up formatting

d1002ac

Clean tracking and octree code

85e96ea

Remove volume abstraction

f56691f

Remove use of traits_type<>::value_type from denseslam

e5784d1

clang-format denseslam and core

920f163

Move tracking from denseslam/ to tracking/

6bb2ecb

Make Octree template on underlying buffer

7d3b532

Fix dim should be float in meshing (emanuelev#25)

6fda70f

Align memory_pool method names with buffer method names

a61fd38

Move memory_pool to backend/

4ee9302

Wire up backends with CMake

aaa2293

Complete MVP Backend, and switch DenseSLAMSystem to use it

09b3312

Fix file perms

1aa0f2d

Move field implementations to voxel_traits

bd23a51

Align memory pool reserve() semantics with std::vector

ea46408

Align and extract inner loops of buildAllocationList

deb7317

Extract buildAllocationList kernel

80416df

Add CUDA backend template + CMake support

a176ffa

Clean up warnings

3f765e8

Initial integration of CUDA kernel code

7e69bd4

Inital CUDA implementation of projective update

b0f7a2a

Fix usage of EIGEN_DEVICE_FUNC instead of SE_DEVICE_FUNC

2b64f0d

bozbez added 23 commits May 10, 2020 12:51

Optimise GPU allocation

891ac8e

Split integration kernel 1x512 -> 64x8

702179a

Tweak updateBlocks launch bounds

c104a52

Improve block update register usage

3d3c575

Reduce OFusion timestamp precision to float

33fdb91

Switch to Z inner loop for updateBlocks

a62a52b

Reinstate parallel allocation

671f22d

Tweak integration launch bounds

8cb4a0f

Improve integration memory characteristics, use size_t in memory_pool

7ec9451

Fix CUDA memory pool init and raycast

1181d83

Rework tracker for multiple backend support

98d8958

Move memory related headers to core/memory

a95af4a

Align memory/ naming

a0b1fbc

Unify BufferCUDA and Image

35332bf

Preparations for CUDA tracking

37c8e70

Add tum2raw to convert TUM RGB-D datasets

929f874

Fix parallel allocation for OFusion

1627157

Re-run clang-format over main dirs

b95bb72

Fix sign compare warnings

62b9d2e

Remove old comments

b56f850

Prepare for benchmarking

6aa5c1a

Tweak benchmarking scripts

087986f

Calculate allocate params, tweak benchmark scripts

8dd5f4d

bozbez force-pushed the master branch from 8b30198 to 4f27446 Compare July 27, 2020 10:24

bozbez added 2 commits July 27, 2020 12:24

Update README

4f27446

Fix CMake warnings

4bf8848

bozbez changed the title ~~Implement a CUDA backend~~ Rework core code and implement a new CUDA based GPU backend Jul 27, 2020

bozbez marked this pull request as ready for review July 28, 2020 10:56

emanuelev self-assigned this Aug 30, 2020

emanuelev requested changes Aug 30, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework core code and implement a new CUDA based GPU backend#50

Rework core code and implement a new CUDA based GPU backend#50
bozbez wants to merge 64 commits intoemanuelev:develfrom
bozbez:master

bozbez commented Jun 23, 2020 •

edited

Loading

Uh oh!

emanuelev left a comment

Uh oh!

emanuelev Aug 30, 2020

Uh oh!

emanuelev Aug 30, 2020

Uh oh!

emanuelev Aug 30, 2020

Uh oh!

emanuelev Aug 30, 2020

Uh oh!

emanuelev Aug 30, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

bozbez commented Jun 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark results

Summary of changes

Overview of new code structure

TODO

Uh oh!

emanuelev left a comment

Choose a reason for hiding this comment

Uh oh!

emanuelev Aug 30, 2020

Choose a reason for hiding this comment

Uh oh!

emanuelev Aug 30, 2020

Choose a reason for hiding this comment

Uh oh!

emanuelev Aug 30, 2020

Choose a reason for hiding this comment

Uh oh!

emanuelev Aug 30, 2020

Choose a reason for hiding this comment

Uh oh!

emanuelev Aug 30, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bozbez commented Jun 23, 2020 •

edited

Loading