Qichen Fu

I am a Member of Technical Staff at Anthropic. Previously, I worked as a Machine Learning Engineer at Apple, focusing on efficient LLMs, efficient NeRFs, and 3D Hand-Object Interaction.

In 2022, I obtained my Master's degree in Robotics (MSR) at the Robotics Institute of Carnegie Mellon University. In CMU, I worked with Prof. Kris Kitani on computer vision research. Specifically, I foused on hand-object interaction.

I obtained my dual Bachelor's degree: B.S in Computer Science at University of Michigan - Ann Arbor, and B.Eng in Electrical & Computer Engineering at Shanghai Jiao Tong University. At UM, I was advised by Prof. David Fouhey working on object articulation detection, cloud geographical location prediction, and 3D hand pose forecasting. I was also advised by Prof. Jeffrey A. Fessler on medical image reconstruction with deep learning.

Email: fuqichen1998@gmail.com Google Scholar / CV / Github / LinkedIn

Research Interests

My research interests are computer vision, generative models, and multimodal machine learning. Recently, I am interested in understanding human activity, reconstructing 3D objects/scenes, and learning to interact with the world. I am also particularly interested in self-supervised and unsupervised learning which exploit prior knowledge such as temporal information, geometry, multimodal consistency, and physical constraints.

Publications

Efficient Vision-Language Models by Summarizing Visual Tokens into Compact Registers

Yuxin Wen, Qingqing Cao, Qichen Fu, Sachin Mehta, Mahyar Najibi

arXiv Preprint 2024
[PDF]

Apple Intelligence Foundation Language Models

Author List

arXiv Preprint 2024
[PDF]

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

Qichen Fu, Minsik Cho, Thomas Merth, Sachin Mehta, Mohammad Rastegari, Mahyar Najibi

ES-FoMo @ ICML 2024
[PDF]

	Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation Thomas Merth, Qichen Fu, Mohammad Rastegari, Mahyar Najibi ICML 2024 [PDF]
	Speculative Streaming: Fast LLM Inference without Auxiliary Models Nikhil Bhendawade, Irina Belousova, Qichen Fu, Henry Mason, Mohammad Rastegari, Mahyar Najibi arXiv Preprint 2024 [PDF]
	FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline Chien-Yu Lin, Qichen Fu, Thomas Merth, Karren Yang, Anurag Ranjan WACV 2024 (Oral) [PDF]
	eDKM: An Efficient and Accurate Train-time Weight Clustering for Large Language Models Minsik Cho, Keivan A Vahid, Qichen Fu, Saurabh Adya, Carlo C Del Mundo, Mohammad Rastegari, Devang Naik, Peter Zatloukal SAGE @ MICRO 2023 [PDF]
	Deformer: Dynamic Fusion Transformer for Robust Hand Pose Estimation Qichen Fu, Xingyu Liu, Ran Xu, Juan Carlos Niebles, Kris M. Kitani ICCV 2023 [PDF][Project][Code]
	Domain Adaptive Hand Keypoint and Pixel Localization in the Wild Takehiko Ohkawa, Yu-Jhe Li, Qichen Fu, Ryosuke Furuta, Kris M. Kitani, Yoichi Sato ECCV 2022 [PDF][Project]
	Sequential Voting with Relational Box Fields for Active Object Detection Qichen Fu, Xingyu Liu, Kris M. Kitani CVPR 2022 [PDF][Project][Code]
	Ego4D: Around the World in 3,000 Hours of Egocentric Video Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbelaez^†, David Crandall^†, Dima Damen^†, Giovanni Maria Farinella^†, Bernard Ghanem^†, Vamsi Krishna Ithapu^†, C. V. Jawahar^†, Hanbyul Joo^†, Kris Kitani^†, Haizhou Li^†, Richard Newcombe^†, Aude Oliva^†, Hyun Soo Park^†, James M. Rehg^†, Yoichi Sato^†, Jianbo Shi^†, Mike Zheng Shou^†, Antonio Torralba^†, Lorenzo Torresani^†, Mingfei Yan^†, Jitendra Malik CVPR 2022 (Oral) Best Paper Finalist [PDF][Project]
	A Self-Supervised Deep Model for Focal Stacking Weizhi Du, Qichen Fu, Zhengyu Huang * indicates equal contribution CLEO 2022 [PDF]
	EgoAugment for Action Recognition Xuhua Huang, Ye Yuan, Xingyu Liu, Qichen Fu, Kris M. Kitani EPIC @ CVPR 2021 [PDF]

Teaching

	16-824: Visual Learning and Recognition (Spring 2022) Teaching Assistant with Prof. Deepak Pathak Carnegie Mellon University School of Computer Science January 2022 - May 2022
	16-720B: Computer Vision (Fall 2021) Teaching Assistant with Prof. Kris Kitani Carnegie Mellon University School of Computer Science August 2021 - December 2021
	EECS 442: Computer Vision (Winter 2020) Instructional Aide with Prof. Justin Johnson University of Michigan - Ann Arbor Computer Science Department January 2020 - April 2020
	EECS 442: Computer Vision (Fall 2019) Instructional Aide with Prof. David Fouhey University of Michigan - Ann Arbor Computer Science Department September 2019 - December 2019