About Me

I am Zhuoran Wu, Senior Computer Vision Scientist in Alcon , Irvine, California.

I worked as Deep Learning Researcher in Standard AI, San Francisco. Our team implements real time deep learning models for person detection, pose estimation, item classification, and action recognition.

I worked as Machine Learning Engineer in Octi.tv, Los Angeles. Our team develops the Computer Vision and Machine Learning technology behind Octi's AugmentedReality video messaging app. Including: Real-time mobile human 3D pose estimation and tracking, semantic instance segmentation, skeletal action recognition and Gaze Tracking.

I worked as Machine Learning Engineer Intern in PingAn Technology, US Research Lab in Summer of 2018. Our team did pedestrian detection project via Faster RCNN and Cascade RCNN.

I graduated from Georgetown University. My research interest is Computer Vision, Deep Learning and Data Mining. I published paper about Face Recognition. I did many projects at Kaggle.

Besides research, I love to make websites. I have now already made more than 20 websites.

I am now an active member in YiGu Studio, which is a team to make ThreeKingdoms Mod for Mount & Blade 2: Bannerload.

Download My Resume

My Skills

  • Machine Learning Deep Learning
  • MLops
  • Data Mining
  • Data Ops
  • Full Stack Engineering
  • Games Making

Research

1. Work Project

Whole lifecycle Real Time Pose Estimation System, 06/2020 - 06/2023

  • Ensure accurate and refined pose estimation by seamlessly integrating Human Feedback into the pose estimation model pipeline. Optimize workflow and maximize productivity through development of a highly efficient full-cycle deep learning system pipeline with MLops. Expand pose dataset to a massive TB level by incorporating fish-eye images and annotations. Initiate productive discussion and delivered tactful negative feedback to manager during half year performance review to prompt subsequent changes in working methods, ultimately resulting in enhanced team performance.
  • Enhanced model inference to an impressive 30 FPS by efficiently loading and running multiple Models on a single GPU.
  • Explored cutting-edge techniques and advancements in field by conducting in-depth research on state-of-the-art pose
    estimation models.
  • Reduced Model training and delivery time significantly by leveraging cost-effective GPUs without compromising on
    performance and accuracy.
  • Fostered transparent communication with clients and internal team, leading to a two-week deadline extension and the
    subsequent on-time completion of the project following the early delivery of a functional demo version.
  • Achieved an impressive 90% person tracking accuracy through successful implementation and deployment of real-time 2D Pose Estimation, 3D Pose Tracking, and person re-identification models for a checkout-free retail system.
  • Resolved disagreement with Team Manager by executing experiments and engaging in discussions to identify a hybrid
    approach, effectively reducing model training and deployment cost for pose estimation and tracking.
  • Related Patents:

2. Computer Vision Project

Face Anti-Spoofing Detection on Mobile Device, 03/2019 - 04/2019

  • Develop real-time Face Anti-Spoofing Algorithm and Mobile System.
  • Research and Develop Face Recognition and Action Recognition Algorithm on Mobile System.

Human Pose Estimation and Tracking on Mobile Device, 04/2019 - 06/2019

  • Related Publication: [Bounding Box Embedding for Single Shot Person Instance Segmentation](https://arxiv.org/pdf/1807.07674v1.pdf)
  • Optimize real-time Deep Learning based Pose Estimation, Tracking and Instance Segmentation algorithm on Mobile System with Flow-based Pose Tracking Algorithm.

Gaze Estimation and Tracking, 05/2019 - 08/2019

  • Develop real-time Deep Learning based Gaze Estimation System with 10ms latency and less than 5 mean degree error on MPIIGaze, UnityEyes datasets.
  • Design and Collect OctiGaze dataset that contains 256,876 images and gaze vectors we collected from 101 participants in both RGB and IR camera.

ECCV 2018 Workshop - WIDER Pedestrian Detection Challenge, 05/2018 – 07/2018

  • Achievement: 1st in Development Phase, 12nd in Final Phase.
  • What I Do: 1) Build deep learning models for Pedestrian Detection tasks with Detectrion and Tensorflow Framework. 2) Implement Cascade RCNN based on Faster RCNN. 3) Implement easy-to-use tools for image and result analysis.
  • Key Words: Object Detection, Pedestrian Detection, Faster RCNN, Cascade RCNN, YOLO, Deep Learning.

3. Data Mining Project in Kaggle

Competition Contributor 01/2016 - NOW

WSDM 2018 WORKSHOP - KKBOX'S CHURN PREDICTION CHALLENGE, 9/18/2017-12/19/2017

  • Achievement: TOP 8%
  • What I Do: 1) Build multiple models to predict whether a user will churn after their membership expires via
    XGBoost, LightGBM, AutoEncoder. 2) Propose a method to do large file feature engineering based on
    “Equivalence Substructure” idea.
  • Key Words: Feature Engineering, Autoencoder, Large Data Processing, Model Stacking

INSTACART MARKET BASKET ANALYSIS, 07/2017- 09/2017

  • Achievement: Bronze Medal, TOP 10%.
  • What I Do: 1) Build models to predict which previously purchased products will be in a user’s next order. 2)
    Use F1 Optimization for post processing.
  • Key Words: Feature Engineering, XGBoost, LightGBM, Model Stacking, F1 optimization.

MERCEDES-BENZ GREENER MANUFACTURING, 07/2017- 07/2017

  • Achievement: TOP 13%
  • What I Do:  1) Build model to predict the time it takes to pass testing.
  • Key Words: Word Embedding, GradientBoostingRegressor, Lasso, LightGBM.

SBERBANK RUSSIAN HOUSING MARKET, 06/2017- 08/2017

  • Achievement: TOP 15%.
  • What I Do: 1) Build model to predict realty house prices
  • Key Words: Feature Engineering, XGBoost, LightGBM, Model Stacking.

QUORA QUESTION PAIRS, 09/2017- 12/2017

  • What I Do: 1) Build a LSTM model to decide whether two questions have similar meaning.
  • Key Words:  Word Embedding, Word2Vec, Representation-based and Interaction-based Model, LSTM-RNN.

4. Research on 3D Face Tracking & Recognition Technology based on Depth Data

Chief Researcher 05/2014- 04/2015

Publication:

Patents:

  • A New Type of Face Deep Data Recognition Algorithm and Application in Intelligent Authority Distribution System, No.: 201510232302.6;
  • One 3-D Face Recognition Algorithm based on Deep Learning SDAE and Application in the Finance Field, No.: 201510145327.2;

5. Research on Non-contact Medical Assistance Detection System of Elderly Parkinson's disease

Chief Researcher, 04/2015-04/2016

Patents:

  • One Human Behavior Recognition Method Based on Manifold Learning, No.: 201510179381.9;
  • Methods of judging fall behaviors and body balance for old people, No.: 201410633427.5;
  • Gait Recognition and Quality Assessment algorithm for elderly Parkinson’s disease based on skeleton data, No.: 201610161393.3;
Publications2/10
Patents5/10
Kaggle Medal1/10

Experience

Currently, I am a Senior Data Scientist / Computer Vision Scientist in Alcon, Irvine, California.
  • Aug 2023 - PRESENT
    Alcon

    Senior Data Scientist / Computer Vision Scientist

  • SEPT.2019 - Aug. 2023
    Standard AI

    Deep Learning Researcher

  • MAR.2019 - AUG.2019
    Octi

    Machine Learning Engineer

  • MAY. 2018 - SETP. 2018
    Ping An Intelligence Institute

    Machine Learning Engineer Intern

  • JAN. 2017 - DEC. 2018
    Georgetown University

    Master Degree, major as Computer Science.

  • Jan. 2016 - Dec. 2016
    QiYou Information Technology Co. Ltd.

    Software Engineer Intern

  • Setp. 2012 - June. 2016
    ChangZhou University

    Bachelar Degree, major as Computer Science

Download Resume

Portfolio

If you want to know more details about my work, feel free to contact me.

Testimonial

Contact

Email munoliver007 [at] gmail [dot] edu

Address20511 Lake Forest Drive, Lake Forest, CA 92630

Interested in working with me? Contact Me Now!