Yixin Zhu | PKU
Yixin Zhu | PKU
Home
Topics
Pre-Print
Publications
Demos
Downloads
Teaching
Students
CoRe Lab
Light
Dark
Automatic
Scene Parsing
[ICLR23] Understanding Embodied Reference with Touch-Line Transformer
We study embodied reference understanding, the task of locating referents using embodied gestural signals and language references. …
Yang Li
,
Xiaoxue Chen
,
Hao Zhao
,
Jiangtao Gong
,
Guyue Zhou
,
Federico Rossano
,
Yixin Zhu
PDF
Cite
Code
Poster
Video
Web
北大新闻网
北大AI院官网
北大新工科官微
北大AI院官微
[ICCV21] YouRefIt: Embodied Reference Understanding with Language and Gesture
We study the machine’s understanding of embodied reference: One agent uses both language and gesture to refer to an object to …
Yixin Chen
,
Qing Li
,
Deqian Kong
,
Yik Lun Kei
,
Song-Chun Zhu
,
Tao Gao
,
Yixin Zhu
,
Siyuan Huang
PDF
Cite
Code
Dataset
Poster
Video
Supp
Web
[CVPR21] Learning Triadic Belief Dynamics in Nonverbal Communication from Videos
Humans possess a unique social cognition capability; nonverbal communication can convey rich social information among agents. In …
Lifeng Fan
,
Shuwen Qiu
,
Zilong Zheng
,
Tao Gao
,
Song-Chun Zhu
,
Yixin Zhu
PDF
Cite
Code
Dataset
Video
Supp
[ECCV20] LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities
Understanding and interpreting human actions is a long-standing challenge and a critical indicator of perception in artificial …
Baoxiong Jia
,
Yixin Chen
,
Siyuan Huang
,
Yixin Zhu
,
Song-Chun Zhu
PDF
Cite
Code
Dataset
Supp
Presentation
Web
[ICRA20] Joint Inference of States, Robot Knowledge, and Human (False-)Beliefs
Aiming to understand how human (false-)belief—a core socio-cognitive ability—would affect human interactions with robots, …
Tao Yuan
,
Hangxin Liu
,
Lifeng Fan
,
Zilong Zheng
,
Tao Gao
,
Yixin Zhu
,
Song-Chun Zhu
PDF
Cite
Video
Presentation
[NeurIPS19] PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
Detecting 3D objects from a single RGB image is intrinsically ambiguous, thus requiring appropriate prior knowledge and intermediate …
Siyuan Huang
,
Yixin Chen
,
Tao Yuan
,
Siyuan Qi
,
Yixin Zhu
,
Song-Chun Zhu
PDF
Cite
Poster
Supp
[ICCV19] Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense
We propose a new 3D holistic++ scene understanding problem, which jointly tackles two tasks from a single-view image: (i) holistic …
Yixin Chen
,
Siyuan Huang
,
Tao Yuan
,
Yixin Zhu
,
Siyuan Qi
,
Song-Chun Zhu
PDF
Cite
Code
Poster
Video
Supp
Web
[NeurIPS18] Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation
Holistic 3D indoor scene understanding refers to jointly recovering the i) object bounding boxes, ii) room layout, and iii) camera …
Siyuan Huang
,
Siyuan Qi
,
Yinxue Xiao
,
Yixin Zhu
,
Ying Nian Wu
,
Song-Chun Zhu
PDF
Cite
Code
Poster
Video
Supp
Web
[ECCV18] Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image
We propose a computational framework to jointly parse a single RGB image and reconstruct a holistic 3D configuration composed by a set …
Siyuan Huang
,
Siyuan Qi
,
Yixin Zhu
,
Yinxue Xiao
,
Yuanlu Xu
,
Song-Chun Zhu
PDF
Cite
Code
Poster
Supp
Web
[CVPR16] Inferring Forces and Learning Human Utilities From Videos
We propose a notion of affordance that takes into account physical quantities generated when the human body interacts with real-world …
Yixin Zhu
,
Chenfanfu Jiang
,
Yibiao Zhao
,
Demetri Terzopoulos
,
Song-Chun Zhu
PDF
Cite
Code
Dataset
Poster
Video
Short
Presentation
»
Cite
×