'AI Research Topic/3D Vision' 카테고리의 글 목록

인간을 이해하기 위한 기술 PoseGPT: Chatting about 3D Human Pose에 대하여

PoseGPT: Chatting about 3D Human Pose 오늘은 인간을 이해하기 위한 기술 중 하나인 PoseGPT에 관한 논문을 리뷰하며 드는 생각을 정리해보고자 합니다. PoseGPT는 Max Planck Institute for Intelligent Systems에서 제안한 논문이며, 2023년 11월 30일에 제안된 따끈따끈한 논문입니다. 🔥🚨🧑‍🚒 PoseGPT란 이미지나 텍스트 설명을 통해 3D 인간 포즈를 이해하고 추론하기 위해 LLM을 사용하는 프레임워크라고 할 수 있습니다. 현재까지 진행되어왔던 연구에서는 이미지 또는 텍스트 기반의 인간 자세 추정 문제들은 전체적인 장면을 이해하거나 세세한 추론이 부족해서 시각적 데이터와 실제 자세가 의미하는 바가 연결성이 부족했습니다. 예를..

AI Research Topic/3D Vision

[3D HPS] SMPL-X 모델 관련 자료

SMPL-X 모델은 기존 SMPL 모델(손이나 얼굴에 대한 detail이 떨어지는)에서 확장된 모델이다. 대량의 3D scan data로부터 얼굴과 손까지 포함한 새로운 body model을 구성하였다. 즉, 3가지 모델을 통합하였다. SMPL 모델 + FLAME(head model) + MANO(hand model) 더 내추럴한 결과를 볼 수 있다. joint names 목록은 다음 github에서 볼 수 있다. https://github.com/vchoutas/smplx/blob/main/smplx/joint_names.py SMPL-X 논문은 아래와 같다. https://arxiv.org/abs/1904.05866 Expressive Body Capture: 3D Hands, Face, and Bo..

AI Research Topic/3D Vision

[3D Pose and Shape] Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset

Project page : https://motion-x-dataset.github.io/ Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset We propose Motion-X, a large-scale 3D expressive whole-body motion dataset. Existing motion datasets predominantly contain body-only poses, lacking facial expressions, hand gestures, and fine-grained pose descriptions. Moreover, they are primarily collecte motion-x-dataset.git..

AI Research Topic/3D Vision

[3D Pose and Shape] 3D Human Pose Visualizer

https://github.com/isarandi/poseviz GitHub - isarandi/poseviz: 3D Human Pose Visualizer for Python 3D Human Pose Visualizer for Python. Contribute to isarandi/poseviz development by creating an account on GitHub. github.com

AI Research Topic/3D Vision

[3D HPS] Learning Visibility for Robust Dense Human Body Estimation

이번에 리뷰할 논문은 "Learning Visibility for Robust Dense Human Body Estimation" 입니다. 본 논문에서는 human joint에 대해 visible 정보를 x, y, z 축에 대해 명시적으로 모델링 했습니다. x, y 축의 visible 정보는 frame 밖에 있는 경우를 구별하는데 도움이 되며, z 축의 visible 정보는 self-occlusion 또는 occlusions by other object을 구별하는데 도움이 됩니다. 본 논문에서는 이러한 정보를 이용하여 3d heatmap을 예측합니다. dense heatmap -based representation은 image domain에서 spatial-relationship을 보존하고, uncert..

AI Research Topic/3D Vision

[3D HPS] SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling

Project page : https://maoxie.github.io/SynBody/ SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling Abstract Synthetic data has emerged as a promising source for 3D human research as it offers low-cost access to large-scale human datasets. To advance the diversity and annotation quality of human models, we introduce a new synthetic dataset, Synbody, with ma..

AI Research Topic/3D Vision

[3D HPS] Capturing and Inferring Dense Full-Body Human-Scene Contact

Paper : https://openaccess.thecvf.com/content/CVPR2022/papers/Huang_Capturing_and_Inferring_Dense_Full-Body_Human-Scene_Contact_CVPR_2022_paper.pdf 이번에 읽을 논문은 "Capturing and Inferring Dense Full-Body Human-Scene Contact" 입니다. RICH dataset을 생성하였는데요, 이 데이터의 novelty는 3D scene scan을 통해 human-scene contact를 정확하게 라벨링했다는 것입니다. 아래 그림에서 녹색 부분에 표시되어있습니다. 참고로 이런 분야를 human-scene contact(HSC) 라고 하는데 이는 사람이 주..

AI Research Topic/3D Vision

[3D HPS] BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated Motion

Paper : https://bedlam.is.tuebingen.mpg.de/media/upload/BEDLAM_CVPR2023.pdf Project page : https://bedlam.is.tue.mpg.de/ BEDLAM bedlam.is.tue.mpg.de GitHub : https://github.com/pixelite1201/BEDLAM GitHub - pixelite1201/BEDLAM Contribute to pixelite1201/BEDLAM development by creating an account on GitHub. github.com 이번에 리뷰할 논문은 GitHub에 약 3일전에 업로드된 CVPR 2023에 소개 될 논문 "BEDLAM: A Synthetic Dataset o..

AI Research Topic/3D Vision

[Paper Review] HuMMan, Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling

Paper : https://arxiv.org/pdf/2204.13686.pdf Project page : https://caizhongang.github.io/projects/HuMMan/ HuMMan HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling Zhongang Cai* Daxuan Ren* Ailing Zeng* Zhengyu Lin* Tao Yu* Wenjia Wang* Xiangyu Fan Yang Gao Yifan Yu Liang Pan Fangzhou Hong Mingyuan Zhang Chen Change Loy Lei Yang^ Ziwei Liu^ Shang caizhongang.github.io 오늘 살펴..

AI Research Topic/3D Vision

[3D Reconstruction] CS231A, Computer Vision, From 3D Reconstruction to Recognition

CS231A: Computer Vision, From 3D Reconstruction to Recognition https://web.stanford.edu/class/cs231a/ CS231A: Computer Vision, From 3D Reconstruction to Recognition Can I audit or sit in? In general we are very open to sitting-in guests if you are a member of the Stanford community (registered student, staff, and/or faculty). Out of courtesy, we would appreciate that you first email us or talk t..

AI Research Topic/3D Vision

[Paper Review] Recovering 3D Human Mesh from Monocular Images : A Survey

Paper : https://arxiv.org/abs/2203.01923 Recovering 3D Human Mesh from Monocular Images: A Survey Estimating human pose and shape from monocular images is a long-standing problem in computer vision. Since the release of statistical body models, 3D human mesh recovery has been drawing broader attention. With the same goal of obtaining well-aligned and p arxiv.org Project : https://github.com/tina..

AI Research Topic/3D Vision

[Stereo Vision] 카메라 캘리브레이션의 내부 및 외부 매개변수(intrinsic, extrinsic parameters)

카메라 캘리브레이션(Camera Calibration)이란? 실 세계는 3차원으로 이루어져있지만 이를 카메라로 촬영하게 되면 2차원 이미지로 투영된다. 이 때 실제 3차원 위치 좌표는 이미지 상에서 어디에 위치하는지 기하학적으로 계산할 때 영상을 찍을 당시의 카메라 위치 및 방향에 의해 결정된다. 하지만 실제 이미지는 사용된 렌즈, 대상과의 거리 등의 내부 요인에 영향을 받기 때문에 3차원 위치 좌표는 영상에 투영된 위치를 구하거나 역으로 영상 좌표로부터 3차원 공간좌표를 복원할 때 이러한 내부 요인을 제거해야 정확한 계산이 가능해진다. 이러한 내부 요인의 파라미터 값을 구하는 과정을 카메라 캘리브레이션이라고 한다. 즉, 사진이나 비디오를 촬영하는 실제 카메라 모델을 단순화 시킨 핀홀(Pinhole) 카..

AI Research Topic/3D Vision

Prev 1 2 Next

티스토리툴바