�ң��ߣ��Ԫ��һ��Լ��Ĵ��룡

��Ԫ
2022��12��12��01ʱ

��Ԫ��

�༭��

��Ԫ��Ԫ��ȳ��չ��ʵ��Ҳ���չ��о��߶��ԣ��Ҫ��о��չ��ʵ��򣬻��Ҫ�ķѲ��ʱ�䡢��ȥ��Ļ��á��ͬ��ĵײ�淶�Լ��֮�䴮�ӵ��⡣

��ǰ��ѧ��绹�ǹ�ҵ�磬��չ��ʵ��зǳ��ḻ��㷨�о��򣬲��㷨�õ��

Ȼ��һ��У��û��Ӿ��һ��Ŀ�Դ�㷨��̬��ҵ�ǰ��ÿ��㷨��Զ��ͬʱҲȱ��ͳһ�ĵײ��

ͼԴ��Arplanet

��0��1

Ϊ��õؽ��⣬�ƶ��չ��ʵ��㷨��Ч��з��ͼ��ɣ��Ϻ��˹��ʵ��㽭��ѧ��Ƽ��ͬ��2022��9��1��˹��ܴ��ϣ��ʽ��OpenXRLab��չ��ʵ��Դƽ̨��

OpenXRLabƽ̨��λΪһվʽ��չ��ʵ��㷨��Ŀ�Դƽ̨��Ŀ��Ը��ʵ��չ��ʵ��ֿɼ��

Ŀǰƽ̨ӵ��ƣ�

��һ��ͨ��1��ṩͳһ�ĵײ㣬ͬʱ֧��C++��Python��ã�

�ڶ��ṩ�˽�Ϊȫ��㷨��ǣ��״ο�Դ��3��Ӧ��ƽ̨��6��㷨��䣻

��ģ�黯��ƣ��ڵ��ʹ�ã�Ҳ��㴮��ʹ�á�

ƽ̨��https://openxrlab.org.cn/

1+3ƽ̨�ܹ�

OpenXRlab��Դƽ̨��Խ�һ��Ϊ��ĸ��ƽ̨��1��ƽ̨��3��Ӧ��㷨ƽ̨��

��У�XR��ƽ̨Ϊ�ϲ��ṩͳһ�Ľӿڣ��Ч�ļ��㣬�ͱ��ڲ��Ĺ��

��ϲ��3��Ӧ��㷨ƽ̨רע��ͬ��໥��ӣ�

XR�ռ��ƽ̨ʹ��Ƕ��֪��ƽ��ռ䣻
XR��ģ̬�˻��ƽ̨��Գ�Ϊ��Ǻͻ��Ĺ��ߣ�
XR��Ⱦ��ƽ̨ʹ��ǴӸ�֪�ͽ��

��Ͽ��XR��ĵײ��и�ʽ��Ĵ��Ͳ�ͬ�Ĳ��ϵͳ��ϲ��зḻ��Ӧ�ñ��AR/VR��˵ȵȣ�OpenXRLab��ڵײ��Ӳ��ϲ�Ĳ��ϵͳ��һ��о�Ա��µ��㷨��߿��ٴӦ��ԭ�͡�

Ϊ�ˣ��ŶӶ��⿪Դ��1��6��⣬��пռ��3��(XRSLAM, XRSfM, SRLocalization)��˻��2��(XRMoCap, XRMoGen)��Ⱦ��1��(XRNeRF)��

��ҿռ��3��⻹��Դ��׸��ʵ�ֻ��ڶ��Эͬ�Ĵ�߶��ƶ�ʵʱ6DoF��λ��ARЧ��Ŀ�Դƽ̨��

��棬��Ǿ��ؿ�һ��ÿ��Ķ�λ��ص㡣

7��

XRPrimer

��Ŀ��ַ��https://github.com/openxrlab/xrprimer

XRPrimer��ΪXR�㷨�ṩͳһ��ݽṹ��ݴ��ӿڵĿ⡣XRPrimer��ȡ��Ϊ��¾��ͬѧ�Ƕ��ľ��鼮��C++ Primer��

��Ϊ��⣬��ṩͳһ��ݽṹ��㷨�ӿڣ�ͬʱ֧��C/C++�� Python �ĵ��ã��ڲ��ṩͨ�õ��㷨�͸�Ч�ļ��㣬�ṩ��չ��ⲿ��ס�

ͬʱ��ײ�Ŀⰲװ�Ƿ񷽱��û��׶ȣ��Ŷ�Ҳ��ѡ��XRPrimer�ĵ��֧�ֲ�ͬƽ̨Դ��룬Ҳ���׻��ṩ��ƽ̨Ԥ��⣬ʹ�ð�װ��Ӽ򵥡�

XRSLAM

��Ŀ��ַ��https://github.com/openxrlab/xrslam

XRSLAM��һ��ڶഫ��ںϵ�SLAM��Դ��Ŀ��OpenXRLab�ռ��ƽ̨�ĺ��ļ��ģ��֮һ��

XRSLAMĿǰ�ṩ��һ��Ż��³��ʵʱ��Ӿ��̼ƣ�ͬʱ֧��ƽ̨��ƶ�ƽ̨��

��SOTAϵͳ��ȣ�XRSLAM�ھ��Ⱥ�Ч�ʷ��涼�߱��ǿ�ľ��ҷǳ��ʹ�á��ΪAR��Ļ��ʩ��Ŷ��ṩ��iPhone�˿�ʵʱ��ƶ��Ӧ�á�

��XRSfMԤ�ȹ��õ��ά��ε�ͼ��XRSLAM��Խ��XRLocalizationʵ��-�˽�ϵ�ʵʱAR��λ��Ч��

δ��XRSLAM��£��ȫ�ֵ�ͼ�ͺ��Ż��γ�һ��Ӿ��SLAMϵͳ��ҿ��֧��˫/��Ŀ��RGB-D��ȸ��͵Ĵ��

XRSfM

��Ŀ��ַ��https://github.com/openxrlab/xrsfm

XRSfM��һ��Դ��Structure-from-Motion(�˶��ָ��ṹ)�Ĵ��ֿ⣬��OpenXRLab�ռ��ƽ̨��

XRSfM�ܹ��Ӱ��лָ��ϡ��ƽṹ��ͼ��λ�ˣ��ؽ��֧�ֺ��ĳ��λ�ͳ��ؽ��

XRSfMʵ��˻��ڹ��ӵĸ�Чƥ�䷽��[1]�ͻ��ڹؼ�֡�ĸ�Ч��ԴSOTAϵͳ��ؽ��ٶȾ��ƣ��ṩ�˻��˹��־��ĳ߶ȹ��ƹ��ܣ��ܹ��ָ��ʵ�߶ȡ�

XRLocalization

��Ŀ��ַ��https://github.com/openxrlab/xrlocalization

XRLocalization��һ��ڸ߾��ͼ��Ӿ��λ��䣬��OpenXRLab�ռ��ƽ̨��

XRLocalization��ģ�黯��ƣ��ṩ��һ�ײ�λ��Ӿ��λ�㷨��ʹ��ܹ��ڴ�߶ȳ��ʵ�ָ�Ч��׼��³��Ķ�λ��

�ÿ��֧��ʹ�ò�ͬ��⡢��ƥ��ͼ��Լ��ߺ��ֶ�λģʽ��

��У��ֲ��Ŀǰ֧��SuperPoint[2]��D2Net[3]�� ͼ��Ŀǰ֧��NetVLAD[4]��ƥ��Ŀǰ֧��GAM[5, 6]�㷨��

��Ŀ��ṩ��Ļ��ģ��㷨�Լ��Ӿ��λpipeline��Ϊѧ��о��͹�ҵӦ��ṩ��Ĵ��빤�ߡ�

XRMoCap

��Ŀ��ַ��https://github.com/openxrlab/xrmocap

XRMoCap��һ��ӽǵĶ��׽��䣬��OpenXRLab��ģ̬�˻��ƽ̨��

XRMoCapĿǰ��3��ص㣺

��һ��ͬʱ֧��˵��˺Ͷ��˵Ķ�Ŀ��׽��֧�ִ��2��ӽǵ��ı궨��Ϊ��룬��ṩ��һϵ�и�Чѡ��͹ؼ��Ĳ��ԣ��е��˹��HuMMan[7]ԭ��ӡ�

�ڶ��ͬʱ֧��3D�ؼ��ģ�ͣ�3D�ؼ��ģ��ǵ�ǰ��2��ʾ��ʽ��Ҳ�ṩ��ǻ��ת��Ż��㷨��

��Ż��ͻ��ѧϰ��㷨��ͳһ�Ŀ��У�֧��MvPose[8], MvPose Tracking[9], MvP[10], 4D Association[11]�ȶ��㷨��û��ͨ��޸��ļ��ٹ��Ͳ��һ��ӽǶ��׽��㷨ԭ�͡�

XRMoGen

��Ŀ��ַ��https://github.com/openxrlab/xrmogen

XRMoGen��һ��ģ̬��嶯��ɵĹ��䣬��OpenXRLab��ģ̬�˻��ƽ̨��Ŀǰ��ÿ��赸Ϊ��㣬��嶯��ɴ��⡣

XRMoGen��3��

��һ��Ǵ��ṹ��׶��Ըߣ��ṩ�˽�Ϊ��ϸ��ʹ��ĵ��

�ڶ��ǿ��֡��ڶ��ɴ��ԱȽϷ��ӣ�XRMoGen��ͼ��ͬ�㷨�Ĵ��ͳһ��һ��£�ʵ�ֶ��㷨�ĳ��󣬴ﵽ�û��չ��Ŀ�ġ�

��Ǹ��˸��2��㷨��DanceRevolution[12]��Bailando[13]��Bailando[13]��ԭ��ӣ��Ч��SOTA�൱��ṹҲ��չ��

XRNeRF

��Ŀ��ַ��https://github.com/openxrlab/xrnerf

XRNeRF�ǻ��PyTorch��ͨ��ģ�黯��Ⱦ��ܣ��OpenXRLab��Ⱦ��ƽ̨��

XRNeRF��4��򳡾��3��NeRFǰ��㷨��г��㷨��NeRF[14], Mip-NeRF[15], KiloNeRF[16]��Instance-NGP[17]��㷨��NeuralBody[18], AnimNeRF[19]��GNR[20]��㷨��Ϊԭ��֧�ֿ��

��Ե�ǰ��ԴNeRF��̡�ģ�黯�̶ȵ͡��ο��Ѷȴ��⣬XRNeRF��ص㣺ģ�黯�̶ȸߡ��׼��ݴ��ߡ�ģ�黯��繹��

��Ҫ�޸��ļ��Ϳ��ɶ��ݴ��ߺ��繹��޸ģ�֧�ָ߱��Ե�ʵ��㷨���

��ʹ�ú��չ��ģ�黯��ƣ�XRNeRF��и��㷨�ϣ�ƽ��ָ��ٷ��룬��ӻ�Ч��Ҳ��Դ��롣

��1��N

XR�ĸ��Ӳ��ڸ��ٷ�չ��㷨Ҳ����㷨��о��ߺͿ��˵��ָ߶��ȶȵ�ͬʱ��δ��д��Ĺ��Ҫ��ɡ�

OpenXRLab�Ŀ�Դֻ��һС��ķ�չ��ҳ��ǳ��ӭ��С��һͬ��Ϊ��Ĺ��ߣ�

�Ŷӱ�ʾ��ӭ�κ��ʽ�Ĺ��ף��wishlist��Ҫ��ֵ��㷨��issue�б��⣬Ҳ��PR�ύ�޸ģ��Լ��ǡ�

��XR��߽�ǧ��ʱ��Ŷ�ϣ��ǵ�һ�д��롣

��ַ��https://github.com/openxrlab

��Ŷ�Ҳ��Ƴ��ϸ��ƽ��ܺ�ʹ�ý̡̳�

�ο��ף�

[1] Ye, Z., Zhang, G., & Bao, H. (2020, May). Efficient covisibility-based image matching for large-scale SfM. In 2020 IEEE International Conference on Robotics and Automation (ICRA) (pp. 8616-8622). IEEE.

[2] DeTone, D., Malisiewicz, T., & Rabinovich, A. (2018). Superpoint: Self-supervised interest point detection and description. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops (pp. 224-236).

[3] Dusmanu, M., Rocco, I., Pajdla, T., Pollefeys, M., Sivic, J., Torii, A., & Sattler, T. (2019). D2-net: A trainable CNN for joint detection and description of local features. arXiv preprint arXiv:1905.03561.

[4] Arandjelovic, R., Gronat, P., Torii, A., Pajdla, T., & Sivic, J. (2016). NetVLAD: CNN architecture for weakly supervised place recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5297-5307).

[5] Yu, H., Ye, W., Feng, Y., Bao, H., & Zhang, G. (2020, November). Learning bipartite graph matching for robust visual localization. In 2020 IEEE International Symposium on Mixed and Augmented Reality (ISMAR) (pp. 146-155). IEEE.

[6] Yu, H., Feng, Y., Ye, W., Jiang, M., Bao, H., & Zhang, G (2022). Improving feature-based visual localization by geometry-aided matching. ArXiv preprint arXiv:2211.08712.

[7] Cai, Z., Ren, D., Zeng, A., Lin, Z., Yu, T., Wang, W., ... & Liu, Z. (2022). HuMMan: multi-modal 4D human dataset for versatile sensing and modeling. In European Conference on Computer Vision. Springer, Cham.

[8] Dong, J., Jiang, W., Huang, Q., Bao, H., & Zhou, X. (2019). Fast and robust multi-person 3D pose estimation from multiple views. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 7792-7801).

[9] Dong, J., Fang, Q., Jiang, W., Yang, Y., Huang, Q., Bao, H., & Zhou, X. (2021). Fast and robust multi-person 3d pose estimation and tracking from multiple views. IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Zhang, J., Cai, Y., Yan, S., & Feng, J. (2021). Direct multi-view multi-person 3d pose estimation. Advances in Neural Information Processing Systems, 34, 13153-13164.

[11] Zhang, Y., An, L., Yu, T., Li, X., Li, K., & Liu, Y. (2020). 4D association graph for realtime multi-person motion capture using multiple video cameras. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 1324-1333).

[12] Huang, R., Hu, H., Wu, W., Sawada, K., Zhang, M., & Jiang, D. (2020). Dance revolution: Long-term dance generation with music via curriculum learning. arXiv preprint arXiv:2006.06119.

[13] Siyao, L., Yu, W., Gu, T., Lin, C., Wang, Q., Qian, C., ... & Liu, Z. (2022). Bailando: 3D dance generation by actor-critic GPT with choreographic memory. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 11050-11059).

[14] Mildenhall, B., Srinivasan, P. P., Tancik, M., Barron, J. T., Ramamoorthi, R., & Ng, R. (2021). NeRF: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 65(1), 99-106.

[15] Barron, J. T., Mildenhall, B., Tancik, M., Hedman, P., Martin-Brualla, R., & Srinivasan, P. P. (2021). Mip-NeRF: A multiscale representation for anti-aliasing neural radiance fields. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 5855-5864).

[16] Reiser, C., Peng, S., Liao, Y., & Geiger, A. (2021). Kilonerf: Speeding up neural radiance fields with thousands of tiny mlps. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 14335-14345).

[17] M��ller, T., Evans, A., Schied, C., & Keller, A. (2022). Instant neural graphics primitives with a multiresolution hash encoding. arXiv preprint arXiv:2201.05989.

[18] Peng, S., Zhang, Y., Xu, Y., Wang, Q., Shuai, Q., Bao, H., & Zhou, X. (2021). Neural body: Implicit neural representations with structured latent codes for novel view synthesis of dynamic humans. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 9054-9063).

[19] Peng, S., Dong, J., Wang, Q., Zhang, S., Shuai, Q., Zhou, X., & Bao, H. (2021). Animatable neural radiance fields for modeling dynamic human bodies. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 14314-14323).

[20] Cheng, W., Xu, S., Piao, J., Qian, C., Wu, W., Lin, K. Y., & Li, H. (2022). Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis. arXiv preprint arXiv:2204.11798.

��ת��Ԫ΢�Ź��ںţ��Ȩ��С��ݲ��վ��κ�Ͷ�ʰ�ʾ��

�ң������ߣ���Ԫ��������һ���Լ��Ĵ��룡

����Ԫ����

����Ԫ����

WEB3.0�������

�ң��ߣ��Ԫ��һ��Լ��Ĵ��룡

��Ԫ��

��Ԫ��

WEB3.0��