Publications

[Semantics Scholar]
[Production] [Patents] [Technical Reports] [Leetcode]

Venue: ICML, NeurIPS, ICLR, CVPR, MLSys, etc.

Peer-Reviewed Conference Publications
(2019 – present, selected, 1st-author)

Peer-Reviewed Workshop Publications
(2019 – present)

Vision/Field Guide/Review Papers
(2019 – present, selected)

  • (preparing) Private AI, with Intel. 2021
  • A Field Guide to Federated Optimization (with Googlers). 2021
    Jianyu Wang, Zachary Charles, Zheng Xu, Gauri Joshi, H. Brendan McMahan, Blaise Aguera y Arcas, Maruan Al-Shedivat, Galen Andrew, Salman Avestimehr, Katharine Daly, Deepesh Data, Suhas Diggavi, Hubert Eichner, Advait Gadhikar, Zachary Garrett, Antonious M. Girgis, Filip Hanzely, Andrew Hard, Chaoyang He, Samuel Horvath, Zhouyuan Huo, Alex Ingerman, Martin Jaggi, Tara Javidi, Peter Kairouz, Satyen Kale, Sai Praneeth Karimireddy, Jakub Konecny, Sanmi Koyejo, Tian Li, Luyang Liu, Mehryar Mohri, Hang Qi, Sashank J. Reddi, Peter Richtarik, Karan Singhal, Virginia Smith, Mahdi Soltanolkotabi, Weikang Song, Ananda Theertha Suresh, Sebastian U. Stich, Ameet Talwalkar, Hongyi Wang, Blake Woodworth, Shanshan Wu, Felix X. Yu, Honglin Yuan, Manzil Zaheer, Mi Zhang, Tong Zhang, Chunxiang Zheng, Chen Zhu, Wennan Zhu
  • Advances and Open Problems in Federated Learning (with Googlers). 2019
    Peter Kairouz, H Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Keith Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, Rafael GL d’Oliveira, Salim El Rouayheb, David Evans, Josh Gardner, Zachary Garrett, Adria Gascón, Badih Ghazi, Phillip B Gibbons, Marco Gruteser, Zaid Harchaoui, Chaoyang He, Lie He, Zhouyuan Huo, Ben Hutchinson, Justin Hsu, Martin Jaggi, Tara Javidi, Gauri Joshi, Mikhail Khodak, Jakub Konečný, Aleksandra Korolova, Farinaz Koushanfar, Sanmi Koyejo, Tancrede Lepoint, Yang Liu, Prateek Mittal, Mehryar Mohri, Richard Nock, Ayfer Özgür, Rasmus Pagh, Mariana Raykova, Hang Qi, Daniel Ramage, Ramesh Raskar, Dawn Song, Weikang Song, Sebastian U Stich, Ziteng Sun, Ananda Theertha Suresh, Florian Tramèr, Praneeth Vepakomma, Jianyu Wang, Li Xiong, Zheng Xu, Qiang Yang, Felix X Yu, Han Yu, Sen Zhao
    [BibTex] [Arxiv] [Jeff Dean’s Tweet] [Publication at Foundations and Trends in Machine Learning]
    Published at FnTML 2020 (Foundations and Trends in Machine Learning, the chief editor is Michael Jordan)
    Highlight: 105 pages, 485 references, 22 Googlers, and 36 academics at 24 institutions!

Preprint

Production

I was a full-stack software engineer and team manager. I have lots of experience in successful Internet product publication. My experience spans nearly all Internet-related software systems development, including cloud computing, distributed systems, machine learning systems, applied machine learning, operating system, and mobile applications. Business domain: cloud computing, automotive intelligent system, autonomous driving, smartphone, Maps/LBS, games, etc

  • Tencent Cloud. 2017-2018
    Worked on business-oriented cloud computing solution; led a team to develop the 1st generation cloud computing solution for the automotive industry.
  • Tencent Venus Distributed Machine Learning System. 2016-2017
    Worked with Shengping Zhou (now CTO@AlphaCloud), developing the large-scale distributed machine learning system for MIG, Tencent. This platform supports various machine learning algorithms and models, including, LR, SVM, GBDT, DNN, etc. One more thing: I have to highlight that one module of Tencent Venus contains the first practice of federated learning system in China, even in the world, earlier than Google FL. At Tencent, we supported vertical federated learning infrastructure for WeBank to model Tencent social network features into their financial models.
  • Tencent Location Data Computing Platform. 2016-2017
    worked on streaming data mining systems and algorithms; this platform contains billion-level daily users; I learned the entire system architecture and related real-time data analytics algorithms.
  • Speech Recognition and Natural Language Processing for Tencent Automotive Operating System. 2015-2016
    This project inspired me to explore ML models, algorithms, and systems; deep learning is very fancy that year; we developed a deep learning model compression engine for our product (at that time, TensorFlow Lite is not released yet)
  • Tencent Games: a Pokemon Go-like Mobile Game Engine 2016
    worked on Unity3D and C++ -based game engine; this is a cross-group collaboration, very fun; I learned a lot of engineering and production culture at Tencent Game.
  • Tencent AI in Car (also called Tencent Automotive Service). 2014-2016
    I was leading an operating system team; worked on the embedded operating system (Android/QNX) and related back-end cloud services, including more than 10 applications and system core service, such as WeChat MyCar, Tencent Maps, Tencent Music, Tencent Video, etc. Please watch the video below to have a taste how Chinese Internet companies advertise their products:-)
  • As a Team Leader, I Developed the 1st generation of Smartphone Navigation SDK, Application, and Service in China (Baidu Navigation SDK). 2012-2013
    Baidu Navigation SDK, Application, and Service is a cross-platform (C++/Java/Objective-C) engine that requires computer graphics (OpenGL ES), routing algorithms (A*, CH), offline and online data compiler and networking service (Hadoop; C++/PHP backend), complex navigation UI message state control and dispatch, cross-platform, layered, and modularized complex system design. I am familiar with all of these core modules.
  • As a Software Engineer, I Contributed to the First Generation of Huawei Smartphone. 2011-2012
    The picture below shows Huawei Ascend P6 (Emotion UI)

Patents

Technical Report

My industrial experience spans cloud computing, distributed system, applied machine learning, data computing platform, mobile computing (Android/iOS/IoT devices), and their applications in AI, Speech Recognition, Games, Maps, and Navigation.

  • 2018 – Data-Driven Cloud Computing Platform and Machine Learning System for the Internet of Vehicles
  • 2016 – The Large-Scale Distributed System and Real-Time Location Data Mining
  • 2017 – TAI: An Intelligent Operating System and Open Platform
  • 2016 – Speech Recognition System for Connected and Autonomous Driving Car
  • 2016 – WeGameMap: Real World Map Rendering Engine for Location-Based Mobile Game
  • 2015 – WeLink: a High-Performance Vehicle-Mobile Networking Library
  • 2014 – TMAP: A System Framework for Mobile Maps and Navigation
  • 2016 – Efficient Spatial Anti-Aliasing Rendering for Line Joins on Vector Maps
  • 2014 – WeCross: a Mobile and Vehicle C/C++ Cross-Platform Library
  • 2017 – A High Precision Private Car Trajectory Dataset and an Open Source Location Data Computing Platform
  • 2017 – A Open Source High Reliable Location SDK for the Automotive Industry
  • 2013 – The First Generation Mobile Navigation SDK Design in China
  • 2012 – Map State Switching under Multi-trigger Condition Based on Finite State Machine Design Pattern
  • 2012 – A General Downloader Engine for iOS/Android Platform
  • 2012 – The Best Practice for Java Native Interface (JNI) Development on Android Operating Platform
  • 2014 – Taxi-Calling Platform and the Open Source Code
  • 2015 – A UX design of Bluetooth Steering Wheel Wireless Controller and the communication system design
  • 2016 – Sliding Window Algorithm Template to Solve All the Leetcode Substring Search Problem.

Leetcode

An Algorithmic Source Code Template for Solving Many Substring Search Problems, 2016
(1.8K upvote, 2000+ stars, 167.8K views)