He is also known for his work on the photorealism enhancement system,[10][11] and for the critical study of the Hirsch index, a metric of scientists' work that is common in the community.[12]
Koltun's research at Stanford contributed to the development of data-driven 3D modeling technology in collaboration with Siddhartha Chaudhuri.[17] Chaudhuri's work along with Koltun, Evangelos Kalogerakis, and Leonidas Guibas resulted in a SIGGRAPH publication in 2011.[18] As a result, Mixamo licensed the technology from Stanford and later Adobe Inc. acquired Mixamo and further developed Adobe Fuse CC, 3D computer graphics software that enabled users to create 3D characters.[19] In 2014, Koltun joined Adobe to conduct research in visual computing with the primary focus on three-dimensional reconstruction.[20]
Koltun left Adobe to join Intel, where he served in various positions until 2021 for the company's R&D projects for Intelligent Systems.[6][4]
Since August 2021, Koltun has been serving as a distinguished scientist at Apple Inc.[16]
Research
At Intel, Koltun contributed to the development of virtual reality simulators for urban autonomous driving, robots, and drones, focusing on deep reinforcement learning techniques with neural networks in virtual environments. These networks underwent trial-and-error learning in VR before being transferred to robots or drones for real-world applications. This method was applied to the ANYmal robot, a quadrupedal machine with proprioceptive feedback in locomotion control.[21][8][22]
The studies in the domain of urban autonomous driving led Koltun's group to the development of the Car Learning to Act (CARLA) project in 2017.[23] It is an open-source simulator, powered by Unreal Engine, that can be used to test self-driving technologies in realistic environments with random dangerous situations.[23][24][25] The project was funded by the Intel Labs and Toyota Research Institute.[23][26]
In 2020, inspired by Google Cardboard, Koltun developed OpenBot along with a German scientist Matthias Müller.[5][27] It is a software stack that transforms Android smartphones into four-wheeled robots capable of navigation, object tracking, and obstacle avoidance. The robot features a 3D-printable chassis, accommodating a controller, LEDs, a smartphone mount, and a USB cable.[27] The software consists of the Arduino Nano board, which bridges the smartphone with the motor actuation tasks and batteries, and an Android app responsible for the integration of data.[5][27] The project was released as open-source software for robotics-related applications with the software development kit available on GitHub.[28][29]
Koltun also contributed to further development in the fields of 3D photorealistic view synthesis and rendering. In 2021, using his work with other researchers at Intel, Enhancing Photorealism Enhancement,[30] a photorealism enhancement system was tested in the Grand Theft Auto 5.[31][32][33]
Koltun co-authored a research that developed Swift, an autonomous drone system using onboard sensors that can match the performance of human world champions.[34][35] The system integrates deep reinforcement learning with real-world data, enabling the drone to perform effectively in physical environments.[34]
Hirsch index critique
Koltun has expressed concerns regarding the h-index's reliability, highlighting the inflation of its values due to the prevalence of multiple co-authorships in scientific communities. This critique was presented in collaboration with David Hafner.[36][37]
Selected works
Elia Kaufmann, Leonard Bauersfeld, Antonio Loquercio, Matthias Müller, Vladlen Koltun, Davide Scaramuzza, Champion-level drone racing using deep reinforcement learning, Nature, vol. 620, August 2023[34]
Richter, Stephan R.; Hassan Abu AlHaija; Koltun, Vladlen (2021). "Enhancing Photorealism Enhancement". arXiv:2105.04619 [cs.CV].
Joonho Lee, Jemin Hwangbo, Lorenz Wellhausen, Vladlen Koltun, Marco Hutter; Learning Quadrupedal Locomotion over Challenging Terrain, Science Robotics (2020)[8]
Elia Kaufmann, Antonio Loquercio, René Ranftl, Matthias Müller, Vladlen Koltun, Davide Scaramuzza; Deep Drone Acrobatics, Robotics: Science and Systems (2020)[38]
Manolis Savva, Abhishek Kadian, Oleksandr Maksymets, Yili Zhao, Erik Wijmans, Bhavana Jain, Julian Straub, Jia Liu, Vladlen Koltun, Jitendra Malik, Devi Parikh, Dhruv Batra; Habitat: A Platform for Embodied AI Research, International Conference on Computer Vision (2019)[39]
Chen Chen, Qifeng Chen, Jia Xu, Vladlen Koltun, Learning to See in the Dark, Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, June 2018
Bai, Shaojie; Zico Kolter, J.; Koltun, Vladlen (2018). "An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling". arXiv:1803.01271 [cs.LG].
Zhou, Qian-Yi; Park, Jaesik; Koltun, Vladlen (2018). "Open3D: A Modern Library for 3D Data Processing". arXiv:1801.09847 [cs.CV].
Alexey Dosovitskiy, German Ros, Felipe Codevilla, Antonio López, Vladlen Koltun; CARLA: An Open Urban Driving Simulator, Conference on Robot Learning (CoRL) 2017
F Yu, V Koltun; Multi-Scale Context Aggregation by Dilated Convolutions, International Conference on Learning Representations (ICLR) 2016
Stephan R Richter, Vibhav Vineet, Stefan Roth, Vladlen Koltun; Playing for Data: Ground Truth from Computer Games, European Conference on Computer Vision (ECCV) 2016
Sergey Levine, Vladlen Koltun; Guided Policy Search, International Conference on Machine Learning (ICML) 2013
Philipp Krähenbühl, Vladlen Koltun; Efficient inference in fully connected CRFs with Gaussian edge potentials, Advances in Neural Information Processing Systems (NIPS) 2011