참고

[1] https://openai.com/blog/dall-e/

[2] Ramesh et al., Zero-Shot Text-to-Image Generation, arXiv, 2021.

[3] NVIDIA GTC 2022, Insights from NVIDIA Research, https://register.nvidia.com/flow/nvidia/gtcfall2022/ attendeeportal/page/sessioncatalog/session/1656017771434001ZAZg

[4] Muller et al., Instant Neural Graphics Primitives with a Multiresolution Hash Encoding, SIGGRAPH, 2022.

[5] Mildenhall et al., NeRF: Representing scenes as neural radiance fields for view synthesis. ECCV, 2020.

[6] Yu et al., PlenOctrees for Real-time Rendering of Neural Radiance Fields, ICCV, 2021.

[7] Yu et al., Plenoxels: Radiance Fields without Neural Networks, CVPR, 2022.

[8] Kim et al., NeuralVDB: High-resolution Sparse Volume Representation using Hierarchical Networks, arXiv, 2022.

[9] Ken Museth, VDB: High-resolution sparse volumes with dynamic topology, TOG, 2013.

[10] NVIDIA GTC 2022, High-resolution Sparse Volume Representation Using Hierarchical Neural Network, https://register.nvidia.com/flow/nvidia/gtcfall2022/attendeeportal/page/sessioncatalog/session/1658516935505001BqIc

[11] Park et al., DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation, CVPR, 2019.

[12] Sitzmann et al., Implicit Neural Representations with Periodic Activation Functions, NeurIPS, 2020.

[13] Tancik et al., Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains, NeurIPS, 2020.

[14] Iqbal et al., Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild, CVPR, 2020.

[15] https://www.youtube.com/watch?v=VzYkvQ9FgBg

[16] Li et al., Learning the depths of moving people by watching frozen people. CVPR, 2019.

[17] Umar Iqbal et al., Hand pose estimation via 2.5D latent heatmap regression, ECCV, 2018.

[18] Ionescu et al., Human3.6M: Large scale datasets and predictive methods for 3D human sensing in natural environments, TPAMI, 2014.