My  Portrait.

Yinda Zhang


I am a research scientist and manager at Google. My research interests lie at the intersection of computer vision, computer graphics, and machine learning. Recently, I focus on empowering 3D vision and perception via machine learning, including dense depth estimation, 3D shape analysis, 3D scene understanding, and neural rendering. I received my Ph.D. in Computer Science from Princeton University, advised by Professor Thomas Funkhouser. Before that, I received a Bachelor degree from Dept. Automation in Tsinghua University, and a Master degree from Dept. ECE in National University of Singapore co-supervised by Prof. Ping Tan and Prof. Shuicheng Yan.

Feel free to reach out if you are interested in working with us as either FTE or intern!


Email:yindanospamz (at) gmail (dot) com
Find me on:  
YES! YES! YES! YES! YES! YES↑ YES! YES! YES!

2023.08 One paper is accepted by SIGGRAPH Asia.
2023.08 One paper is accepted by TPAMI.
2023.07 Five papers are accepted by ICCV2023.
2023.04 One paper is accepted by CHI 2023.
2023.02 Four papers accepted by CVPR2023.
2022.07 Four papers accepted by ECCV2022.
2022.05 Debut of AR technology from our team on Google I/O 2022.
2022.05 We released Portrait Depth API and 3D Photo live demo on Google I/O 2022. See more in TensorFlow blog post.
2022.04 Two papers are accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).
2022.03 Two papers are accepted by ACM SIGGRAPH 2022.
2022.03 Two papers are accepted by CVPR2022.
2021.12 I am thrilled to announce the arrival of a wonderful baby boy to our home.
2021.11 One paper got accepted by AAAI2022.
2021.07 Five papers are accepted by ICCV2021.
2021.03 Five papers (2 Orals + 3 Posters) are accepted by CVPR2021.
2020.07 Three papers (2 Orals + 1 Posters) are accepted by ECCV2020.
2020.04 Our deep learning depth refinement solution is checked in Pixel4 front-facing camera to support computational photography and AR applications. Learn more here: Google AI blogpost.
2020.02 Pixel2Mesh is accepted by IEEE Transactions of Pattern Analysis and Machine Intelligence.
2020.02 Four papers are accepted by CVPR2020.
2019.12 Our deep learning based solution for image based depth estimation has been deloyed on Google Pixel4 for Portrait Mode. Check this Google AI blogpost for more details.
2019.07 Pixel2Mesh++ is accepted by ICCV2019. Check here for the paper.
2019.03 DeepLidar paper is accepted by CVPR2019.
2018.12 I started working at Google as a Research Scientist.
2018.11 I obtain Ph.D degree from Princeton University.
2018.11 I am awarded as Siebel Scholar Class of 2019.
2018.07 ActiveStereoNet is accepted as oral presentation by ECCV 2018.
2018.07 Pixel2Mesh is accepted by ECCV 2018.


litnerf

LitNeRF: Intrinsic Radiance Decomposition for High-Quality View Synthesis and Relighting of Faces

K. Sarkar, M. Bühler, G. Li, D. Wang, D. Vicini, J. Riviere, Y. Zhang, S. Orts-Escolano1, P. Gotardo, T. Beeler, A. Meka

ACM SIGGRAPH ASIA 2023

[Paper] [Project Webpage]


deepsfm_journal

DeepSFM: Robust Deep Iterative Refinement for Structure From Motion

X. Wei, Y. Zhang, X. Ren, Z. Li, Y. Fu, X. Xue

IEEE Transactions on Pattern Analysis and Machine Intelligence, 10.1109/TPAMI.2023.3307567

[Paper] [Project Webpage]


sg_hand

Spectral Graphormer: Spectral Graph-based Transformer for Egocentric Two-Hand Reconstruction using Multi-View Color Images

THE. Tse, F. Mueller, Z. Shen, D. Tang, T. Beeler, M. Dou, Y. Zhang, S. Petrovic, HJ. Chang, J. Taylor, B. Doosti

International Conference on Computer Vision (ICCV 2023)

[Paper] [Project Webpage]


ar_for_3D_gen

Learning Versatile 3D Shape Generation with Improved AR Models

S. Luo, X. Qian, Y. Fu, Y. Zhang, Y. Tai, Z. Zhang, C. Wang, X. Xue

International Conference on Computer Vision (ICCV 2023)

[Paper]


shape_rep_with_corr

Self-supervised Learning of Implicit Shape Representation with Dense Correspondence for Deformable Objects

B. Zhang, J. Li, X. Deng, Y. Zhang, C. Ma, H. Wang

International Conference on Computer Vision (ICCV 2023)

[Paper] [Project Webpage]


light_weight_tof_for_slam

Multi-modal neural radiance field for monocular dense slam with a light-weight tof sensor

X. Liu, Y. Li, Y. Teng, H. Bao, G. Zhang, Y. Zhang, Z. Cui

International Conference on Computer Vision (ICCV 2023)

[Paper] [Project Webpage]


HO-NeRF

Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views

W. Qu, Z. Cui, Y. Zhang, C. Meng, C. Ma, X. Deng, H. Wang>

International Conference on Computer Vision (ICCV 2023)

[Paper] [Project Webpage]


monoavatar

MonoAvatar: Learning Personalized High Quality Volumetric Head Avatars from Monocular RGB Videos

Z. Bai, F. Tan, Z. Huang, K. Sarkar, D. Tang, D. Qiu, A. Meka, R. Du, M. Dou, S. Orts-Escolano, R. Pandey, P. Tan, T. Beeler, S. Fanello, Y. Zhang

Computer Vision and Pattern Recognition (CVPR 2023)

[Paper] [Project Webpage]


sine

SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field

C. Bao*, Y. Zhang*, B. Yang*, T. Fan, Z. Yang, H. Bao, G. Zhang, Z. Cui

Computer Vision and Pattern Recognition (CVPR 2023)

[Paper] [Project Webpage] [Codes]


hybridnerf

Hybrid Neural Rendering for Large-Scale Scenes with Motion Blur

P. Dai*, Y. Zhang*, X. Yu, X. Lyu, X. Qi

Computer Vision and Pattern Recognition (CVPR 2023)

[Paper] [Project Webpage] [Codes]


gradpu

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions

Y. He, D. Tang, Y. Zhang, X. Xue, Y. Fu

Computer Vision and Pattern Recognition (CVPR 2023)

[Paper] [Project Webpage] [Codes]


rapsai

Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications through Visual Programming

R. Du, N. Li, J. Jin, M. Carney, S. Miles, M. Kleiner, X. Yuan, Y. Zhang, A. Kulkarni, X. Liu, S. Orts-Escolano, A. Kar, P. Yu, R. Iyengar, A. Kowdle, A. Olwal

ACM SIG-CHI 2023

[Paper] [Project Webpage] [Codes]


neumesh

NeuMesh: Learning Disentangled Neural Mesh-based Implicit Field for Geometry and Texture Editing

B. Yang, C. Bao, J. Zeng, H. Bao, Y. Zhang, Z. Cui, G. Zhang

European Conference on Computer Vision (ECCV 2022)

[Paper] [Project Webpage] [Codes]


delta

DELTAR: Depth Estimation from a Light-weight ToF Sensor And RGB Image

Y. Li, X. Liu, W. Dong, H. Zhou, H. Bao, G. Zhang, Y. Zhang, Z. Cui,

European Conference on Computer Vision (ECCV 2022)

[Paper] [Project Webpage] [Codes and Data]


PRIF

PRIF: Primary Ray-based Implicit Function

B. Feng, Y. Zhang, D. Tang, R. Du, A. Varshney

European Conference on Computer Vision (ECCV 2022)

[Paper] [Project Webpage] [Codes (Coming Soon)]


LORD

LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling

B. Jiang, X. Ren, M. Dou, X. Xue, Y. Fu, Y. Zhang

European Conference on Computer Vision (ECCV 2022)

[Paper] [Project Webpage] [Codes]


NR_in_a_room

Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene

B. Yang, Y. Zhang, Y. Li, Z. Cui, S. Fanello, H. Bao, G. Zhang

ACM SIGGRAPH (Transactions on Graphics, 2022)

[Paper] [Project Webpage] [Codes (Coming Soon)]


voluxgan

VoLux-GAN: A Generative Model for 3D Face Synthesis with HDRI Relighting

F. Tan, S. Fanello, A. Meka, S. Orts-Escolano, D. Tang, R. Pandey, J. Taylor, P. Tan, Y. Zhang

ACM SIGGRAPH (Conference Track, 2022)

[Paper] [Project Webpage] [Codes]


h4d

Human 4D Modeling by Learning Neural Compositional Representation

B. Jiang*, Y. Zhang*, X. Wei, X. Xue, Y. Fu

Computer Vision and Pattern Recognition (CVPR 2022)

[Paper] [Project Webpage] [Codes]


density-aware-compression

Density-preserving Deep Point Cloud Compression

Y. He, X. Ren, D. Tang, Y. Zhang, X. Xue, Y. Fu

Computer Vision and Pattern Recognition (CVPR 2022)

[Paper] [Project Webpage] [Codes]


view-selection-hand

Efficient Virtual View Selection for 3D Hand Pose Estimation

J. Cheng, Y. Wan, D. Zuo, C. Ma, J. Gu, P. Tan, H. Wang, X. Deng, Y. Zhang

AAAI Conference on Artificial Intelligence (AAAI2022)

[Paper] [Project Webpage] [Codes]


hand_pose_cascade_alignment

Recurrent 3D Hand Pose Estimation Using Cascaded Pose-guided 3D Alignments

X. Deng, D. Zuo, Y. Zhang, Z. Cui, J. Cheng, P. Tan, L. Chang, M. Pollefeys, S. Fanello, H. Wang

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), DOI: 10.1109/TPAMI.2022.3159725

[Paper (Coming Soon)] [Project Webpage (Coming Soon)] [Codes (Coming Soon)]


deeppanocontext

DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization

C. Zhang, Z. Cui, C. Chen, S. Liu, B. Zeng, H. Bao, Y. Zhang

International Conference on Computer Vision (ICCV 2021), Oral Presentation

[Paper] [Project Webpage] [Codes]


object_nerf

Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering

B. Yang, Y. Zhang, Y. Xu, Y. Li, H. Zhou, H. Bao, G. Zhang, Z. Cui

International Conference on Computer Vision (ICCV 2021)

[Paper] [Project Webpage] [Codes]


mdif

Multiresolution Deep Implicit Functions for 3D Shape Representation

Z. Chen, Y. Zhang, K. Genova, S. Fanello, S. Bouaziz, C. Haene, R. Du, C. Keskin, T. Funkhouser, D. Tang

International Conference on Computer Vision (ICCV 2021)

[Paper] [Codes]


deephybridprior

Deep Hybrid Self-Prior for Full 3D Mesh Generation

X. Wei, Z. Chen, Y. Fu, Z. Cui, Y. Zhang

International Conference on Computer Vision (ICCV 2021)

[Paper] [Project Webpage] [Codes]


interhand

Interacting Two-Hand 3D Pose and Shape Reconstruction from Single Color Image

B. Zhang, Y. Wang, X. Deng, Y. Zhang, P. Tan, C. Ma, H. Wang

International Conference on Computer Vision (ICCV 2021)

[Paper] [Project Webpage] [Codes]


humangps

HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences

F. Tan, D. Tang, M. Dou, K. Guo, R. Pandey, C. Keskin, R. Du, D. Sun, S. Bouaziz, S. Fanello, P. Tan, Y. Zhang

Computer Vision and Pattern Recognition (CVPR 2021)

[Paper] [Project Webpage] [Codes]


implicit_scene

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

C. Zhang*, Z. Cui*, Y. Zhang*, B. Zeng, M. Pollefeys, S. Liu

Computer Vision and Pattern Recognition (CVPR 2021)

[Paper] [Project Webpage] [Codes]



4d_human_representation

Learning Compositional Representation for 4D Captures with Neural ODE

B. Jiang*, Y. Zhang*, X. Wei, X. Xue, Y. Fu.

Computer Vision and Pattern Recognition (CVPR 2021)

[Paper] [Project Webpage] [Data, Model, Code] [Video]



spatial_outdoor_lighting

Spatially-Varying Outdoor Lighting Estimation from Intrinsics

Y. Zhu, Y. Zhang, S. Li, B. Shi

Computer Vision and Pattern Recognition (CVPR 2021), Oral Presentation

[Paper]



hitnet

HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching

V. Tankovich, C. Hane, Y. Zhang, A. Kowdle, S. Fanello, S. Bouaziz

Computer Vision and Pattern Recognition (CVPR 2021), Oral Presentation

[Paper] [Pretrained Model and Evaluation Code]



dudunet

Du2Net: Learning Depth Estimation from Dual-Cameras and Dual-Pixels

Y. Zhang, N. Wadhwa, S. Orts-Escolano, C. haene, S. Fanello, R. Garg

European Conference on Computer Vision (ECCV 2020), Oral Presentation

[Paper] [Project Webpage]



deepsfm

DeepSFM: Structure From Motion Via Deep Bundle Adjustment

X. Wei*, Y. Zhang*, Z. Li*, Y. Fu, X. Xue

European Conference on Computer Vision (ECCV 2020), Oral Presentation

[Paper] [Project Webpage] [Codes]



geolayout

GeoLayout: Geometry Driven Room Layout Estimation Based on Depth Maps of Planes

W. Zhang, W. Zhang, Y. Zhang

European Conference on Computer Vision (ECCV 2020)

[Paper] [Project Webpage] [Matterport3D-Layout Dataset]



PBRNET

PBR-Net: Imitating Physically Based Rendering using Deep Neural Network

P. Dai, Z. Li, Y. Zhang, S. Liu, B. Zeng

IEEE Transactions on Image Processing, 16 Apr 2020, DOI: 10.1109/TIP.2020.2987169.

[Paper]


NPT

Neural Pose Transfer by Spatially Adaptive Instance Normalization

J. Wang, C. Wen, Y. Fu, H. Lin, T. Zou, X. Xue, Y. Zhang

Computer Vision and Pattern Recognition (CVPR 2020)

[Paper] [Project Webpage (Poster, Video)] [Code and Data]



NPT

Deep Implicit Volume Compression

D. Tang, S. Singh, P. Chou, C. Haene, M. Dou, S. Fanello, J. Taylor, P. Davidson, O. Guleryuz, Y. Zhang, S. Izadi, A. Tagliasacchi, S. Bouaziz, C. Keskin

Computer Vision and Pattern Recognition (CVPR 2020), Oral Presentation

[Paper] [Project Webpage] [Video]



NPCR

Neural Point Cloud Rendering via Multi-Plane Projection

P. Dai*, Y. Zhang*, Z. Li*, S. Liu, B. Zeng

Computer Vision and Pattern Recognition (CVPR 2020)

[Paper] [Project Webpage] [Code and Data] [Poster] [Video]



DIST

DIST: Rendering Deep Implicit Signed Distance Function with Differentiable Sphere Tracing

S. Liu, Y. Zhang, S. Peng, B. Shi, M. Pollefeys, Z. Cui

Computer Vision and Pattern Recognition (CVPR 2020)

[Paper] [Supplementary Material] [Code and Data] [Project Webpage] [Video] [Poster] [Slides]



Pixel2Mesh++

Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation

C. Wen*, Y. Zhang*, Z. Li*, Y. Fu

International Conference on Computer Vision (ICCV 2019)

[Paper] [Code and Data] [Project Webpage]



DeepLidar

DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene from Sparse LiDAR Data and Single Color Image

J. Qiu*, Z. Cui*, Y. Zhang*, X. Zhang, S. Liu, B. Zeng, M. Pollefeys

Computer Vision and Pattern Recognition (CVPR 2019)

[Paper] [Code and Data]



ActiveStereoNet

ActiveStereoNet: End-to-End Self-Supervised Learning for Active Stereo Systems

Y. Zhang, S. Khamis, C. Rhemann, J. Valentin, A. Kowdle, V. Tankovich, M. Schoenberg, S. Izadi, T. Funkhouser, S. Fanello

European Conference on Computer Vision (ECCV 2018), Oral Presentation

[Paper] [Supplimentary] [Arxiv] [Slides] [Oral Presentation] [Project Webpage]



pixel2mesh

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images

N. Wang*, Y. Zhang*, Z. Li*, Y. Fu, W. Liu, Y. Jiang

European Conference on Computer Vision (ECCV 2018)

IEEE Transactions on Pattern Analysis and Machine Intelligence, 02 Apr 2020, DOI: 10.1109/TPAMI.2020.2984232.

[Paper] [Arxiv] [Code] [Project Webpage]



DeepCompletion

Deep Depth Completion of a Single RGB-D Image

Y. Zhang, T. Funkhouser

Computer Vision and Pattern Recognition (CVPR 2018)

[Paper] [Supplimentary] [Arxiv] [Spotlight Presentation] [Project Webpage] [Code]



Matterport3d

Matterport3D: Learning from RGB-D Data in Indoor Environments

A. Chang, A. Dai, T. Funkhouser, M. Halber, M. Niessner, M. Savva, S. Song, A. Zeng, Y. Zhang

International Conference on 3D Vision (3DV 2017)

[Paper] [Arxiv] [Project Webpage]



PBRS

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

Y. Zhang, S. Song, E. Yumer, M. Savva, J. Lee, H. Jin, T. Funkhouser

Computer Vision and Pattern Recognition (CVPR 2017)

[Paper] [Supplimentary] [Arxiv] [Project Webpage] [Code]



DeepContext

DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Y. Zhang, M. Bai, P. Kohli, S. Izadi, J. Xiao

International Conference on Computer Vision (ICCV 2017)

[Paper] [Supplimentary] [Arxiv] [Project Webpage] [Video Youtube] [Video Download]



Hand3D

Hand3D: Hand Pose Estimation using 3D Neural Network

X. Deng*, S. Yang*, Y. Zhang*, P. Tan, L. Chang, H. Wang

arXiv:1704.02224 [cs.CV] (7 Apr 2017)

[Paper] [Project Webpage]



jointhand

Joint Hand Detection and Rotation Estimation Using CNN

X. Deng, Y. Zhang, S. Yang, P. Tan, L. Chang, Y. Yuan, H. Wang

IEEE Transactions on Image Processing, 04 Dec 2017, DOI: 10.1109/TIP.2017.2779600.

[Paper] [Project Webpage]



LSUN

LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop

F. Yu, A. Seff, Y. Zhang, S. Song, T. Funkhouser, J. Xiao

arXiv:1506.03365 [cs.CV] 10 Jun 2015

[Paper] [Project Webpage]



Turkergaze

TurkerGaze: Crowdsourcing Saliency with Webcam based Eye Tracking

P. Xu, K. A. Ehinger, Y. Zhang, A. Finkelstein, S. R. Kulkarni, J. Xiao

arXiv:1504.06755 [cs.CV] 25 Apr 2015

[Paper] [Supplimentary] [Video Youtube] [Video Download] [Project Webpage]



PanoContext

PanoContext: A Whole-room 3D Context Model for Panoramic Scene Understanding

Y. Zhang, S. Song, P. Tan, and J. Xiao

European Conference on Computer Vision (ECCV 2014), Oral Presentation

[Paper] [Supplimentary] [Oral Presentation] [Conference Talk] [Detailed Video Youtube] [Detailed Video Download (8min)] [Short Video Download (1min)] [Project Webpage]



FrameBreak

FrameBreak: Dramatic Image Extrapolation by Guided Shift-Maps

Y. Zhang, J. Xiao, J. Hays, P. Tan

Computer Vision and Pattern Recognition (CVPR 2013)

[Paper] [Supplimentary] [Project Webpage]



Google, Mountain View

Research Scientist
12/2018--Present

Princeton University, Vision and Robotics Group

Ph.D. in Department of Computer Science
09/2014--11/2018

Google (through AutoRoboto Inc.), Mountain View

Research Intern
09/2017--10/2018

Matterport Inc., Sunnyvale

Research Intern
06/2017--08/2017

Adobe Research, San Jose

Research Intern
06/2016--08/2016

Microsoft Research, Redmond

Research Intern
07/2015--08/2015

National University of Singapore, Singapore

M.Eng. in Department of Electrical and Computer Engineering
01/2010--01/2013

Microsoft Research Asia, Beijing, Vision Computing Group

Research Intern
05/2010--08/2011

Tsinghua University, Beijing

B.Eng. in Department of Automation
08/2005--08/2009


Media Post:


Invited Talk:


Organizer of Workshops & Tutorial:


Reviewer of Conferences:

CVPR, ECCV, ICCV, SIGGRAPH, NIPS, AAAI, BMVC, 3DV, ACCV, ICPR, ICRA


Reviewer of Journals:

PAMI, IJCV, TOG, MVAP, TIP, NEUCOM


Awards:


Teaching:


Previously Monitored Students:


Email:  yindanospamz (at) gmail (dot) com
 
Address:  1600 Amphitheatre Pkwy,
Mountain View, CA, US, 94043