3DV 2017, 在青岛等你

2017-09-22 20:56:42






主页: http://3dv.org





近年来,以三维视觉(3D Vision,下文简称3DV)为核心技术的行业不断涌现,市场潜力巨大,科研发展迅猛。3DV覆盖的专业领域包括但不仅限于三维获取、建模以及学习,面向诸如自动驾驶、医疗和机器人等领域应用。3DV 2016由斯坦福大学承办,吸引了逾500名专家学者参会,已经成为汇集三维视觉研究、原型系统、商业产品和人力资源的盛会。


3DV 2017由山东大学承办,大会主席为山东大学计算机学院和软件学院院长陈宝权教授。本届大会设立口头报告短报告、特邀报告海报展示等环节,为参会专家学者营造充分交流的理想平台。日程安排请见文章末尾,或浏览:http://irc.cs.sdu.edu.cn/3dv/program.html


3DV 2017录取学术论文73篇,代表了相关领域在三维重建三维深度学习运动捕捉三维场景理解SLAM等众多前沿科学问题的最新进展。








注册即可参加3DV的所有会议环节,包括keynote, proceedings, coffee breaks和 banquet。详细会议日程请查看:http://irc.cs.sdu.edu.cn/3dv/program.html













北京大学教授、院士、ACM/IEEE Fellow



Online visual processing for 3D reconstruction, SLAM, and object recognition



Niloy J. Mitra




Building a Factorized Scene Model: CapturingAppearance, Geometry, and Interactions



Building realistic and accurate scene models of the world, both indoor andoutdoor, has long remained a central goal of shape analysis. While it is noweasy to capture large volumes of data including images, videos, scans,converting such data to a factorized representation with semantic annotationsremain a major challenge. For example, given an image, can we tell what are theobjects in the scene, how are they illuminated, or how will they functionallybehave in presence of external forces. Similar questions apply outdoors. 

In this talk, I will discuss some of our recent attempts to factorize rawmeasurements into scene geometry, appearance, and their interactions. I willdiscuss how synthetically rendered images can be used to discover objectarrangements in photographs, capture real world illumination and texture usinggeometric proxies, and 'read off' physical object properties by observing themcollide in space. Our methods allow for large-scale unsupervised production ofrichly textured 3D models directly from image data, providing high-qualityrealistic objects for 3D scene design or photo editing applications, as well asa wealth of data for training machine learning algorithms for various inferencetasks in graphics and vision. For more details, data and code, pleasevisit: geometry.cs.ucl.ac.uk.




香港科技大学教授,IEEE Fellow



Computer Vision, Visual Learning, and 3D Reconstruction: Modeling the world with drones and smartphones!



Professor Quan leads a computer vision team that uses photographs and deep visual learning technologies to produce complete 3D reconstruction of all types of locations and objects. In this talk, he reviews the developments in computer vision and visual learning over the past two decades. He also turns the focus on recent exciting work in deep visual learning and 3D reconstruction breakthrough in computer vision. Here, he showcases the approach using case studies of large-scale 3D reconstructions of hundreds of square kilometers high-rise metropolitan areas and undeveloped rural areas from drones, and small-scale daily objects from smartphones.  He also demonstrates the online cloud platform and portal www.altizure.com with its crowd-sourced Altizure Earth, developed and funded by the HKUST team, rivaling the popular Google Earth!



Davide Scaramuzza 




Robust, Visual-Inertial State Estimation: from Frame-based to Event-based Cameras


I will present the main algorithms to achieve robust, 6-DOF, state estimation for mobile robots using passive sensing. Since cameras alone are not robust enough to high-speed motion and high-dynamic range scenes, I will describe how IMUs and event-based cameras can be fused with visual information to achieve higher accuracy and robustness. I will therefore dig into the topic of event-based cameras, which are revolutionary sensors with a latency of microseconds, a very high dynamic range, and a measurement update rate that is almost a million time faster than standard cameras. Finally, I will show concrete applications of these methods in autonomous navigation of vision-controlled drones. 







3D Vision Research and Applications at Baidu



In the Inaugural AI Developer Conference “Baidu Create” on July 5th 2017, Baidu announced its all-in on AI policy. Among the AI-based open platforms announced during the conference, Appollo is for autonomous driving. I will talk about 3D vision’s crucial rule in the Appollo platform, in particular how 3D vision and deep learning can mutually benefit from each other’s advances. In addition, I will introduce 3D vision related research and productization in AR and robotics.      





微软雷德蒙研究院首席研究员,ACM/IEEE Fellow



3D Computer Vision for ImmersiveInteraction and Remote Collaboration



We look into human-computer interaction andhuman-human remote collaboration. In human-computer interaction, multitouchinteraction has become increasingly popular because touch input feels morenatural than the traditional keyboard and mouse. Consequently, there has beenrapid development of multitouch interactive display technologies. Thetraditional touch interaction, however, is not without its limitations. Onefundamental limitation is that the touch is “blind.” The system does not knowanything that happens off the board. We propose a system that augments touchinput with visual understanding of the user to improve interaction with a largetouch-sensitive display. A commodity color plus depth sensor such as MicrosoftKinect adds the visual modality and enables new interactions beyond touch.Through visual analysis, the system understands where the user is, who the useris, which hand the user is using, and what the user is doing even before theuser touches the display. Such information is used to enhance interaction inmultiple ways. In human-human remote collaboration, the existingvideoconferencing systems, whether they are available on desktop and mobiledevices or in dedicated conference rooms with built-in furniture and life-sizedhigh-definition video, leave a great deal to be desired: mutual gaze, 3D,motion parallax, spatial audio, to name a few. We propose an immersiveTelepresence system that aims at bringing immersive experience intotelecommunication so people across geographically distributed sites caninteract collaboratively as if they were face-to-face. Computer vision,graphics and acoustics are used in capturing and rendering 3D dynamicenvironments in order to create the illusion that the remote participants arein the same room. Over the years, Microsoft has been conducting research anddevelopment of novel technologies to improve users’ experience in multimodalinteraction and immersive telecommunications. 




DepthSynth: Real-Time Realistic Synthetic Data Generation from CAD Modelsfor 2.5D Recognition



OctNetFusion: Learning Depth Fusion from Data

作者提出了一种利用深度学习的3DCNN框架OctNetFusion,依据输入的多视角深度图像,生成精确,完整的的3D重建结果。本文方法相比较之前的depth fusion的方法,能够有效处理遮挡的情况。利用合成的深度图像以及真实的kinect采集的数据,模型都可以产生引人注目的结果。



3D Shape Reconstruction from Sketches via Multi-view Convolutional Networks




Real-time Full-Body Motion Capture from Video and IMUs




Efficient Deformable Shape Correspondence via Kernel Matching


更多精彩论文,请见: http://irc.cs.sdu.edu.cn/3dv/program.html









3DV 2017诚挚邀请您的参与






20171010 星期二


9:00 - 9:15 AM     Opening Remarks

9:15 - 10:00 AM   Keynote 1: Zhengyou Zhang, "3D Computer Vision for Immersive Interaction and Remote Collaboration"

10:00 - 10:30 AM Coffee Break

10:30 - 11:10 AM Oral Session 1

11:10 - 11:30 AM Spotlight Session 1

11:30 - 12:30 PM Poster session 1

12:30 - 2:00 PM   Lunch

2:00 - 2:45 PM     Keynote 2: Davide Scaramuzza,"Robust, Visual-Inertial State Estimation: from Frame-based to Event-based Cameras"

2:45 - 3:10 PM     Coffee Break

3:10 - 3:50 PM     Oral Session 2

3:50 - 5:20 PM     Forum: The challenges and opportunities in 3D sensing

5:20 - 6:20 PM     Poster session 1


20171011 星期三



9:00 - 9:15 AM     Announcements

9:15 - 10:00 AM   Keynote 3: Wen Gao, Online visual processing for 3D reconstruction, SLAM, and object recognition

10:00 - 10:30 AM Coffee Break

10:30 - 11:10 AM Oral Session 3

11:10 - 11:30 AM Spotlight Session 2

11:30 - 12:30 PM Poster session 2

12:30 - 2:00 PM   Lunch

2:00 - 2:45 PM     Keynote 4: Long Quan, "Computer Vision, Visual Learning, and 3D Reconstruction: Modeling the world with drones and smartphones!"

2:45 - 3:10 PM     Coffee Break

3:10 - 3:50 PM     Oral Session 4

3:50 - 4:25 PM     Spotlight Session 3

4:25 - 5:25 PM     Poster session 2

5:30 - 8:00 PM     Banquet



20171012 星期四



9:00 - 9:15 AM     Announcements

9:15 - 10:00 AM   Keynote 5: Niloy Mitra, "Building a Factorized Scene Model: Capturing Appearance, Geometry, and Interactions"

10:00 - 10:30 AM Coffee Break

10:30 - 11:10 AM Oral Session 5

11:10 - 11:30 AM Spotlight Session 4

11:30 - 12:30 PM Poster session 3

12:30 - 2:00 PM   Lunch

2:00 - 2:45 PM     Keynote 6: Ruigang Yang, "3D Vision Research and Applications at Baidu"

2:45 - 3:10 PM     Coffee Break

3:10 - 3:50 PM     Oral Session 6

3:50 - 4:20 PM     Spotlight Session 5

4:20 - 5:20 PM     Poster session 3

5:20 - 6:20 PM     Awards, Closing Remarks and 3DV-18


更多详细日程请访问: http://irc.cs.sdu.edu.cn/3dv/program.html