I am broadly interested in Machine Learning, especially in Natural Language Processing, Computer Vision, and Deep Learning.
Thrilled to start my career journey at Duolingo as a Software Engineer!
Education
Columbia University, New York
Master of Science in Computer Science || September 2021 - December 2022 Cloud Computing & Big Data Intro to Database Natural Language Processing Operation Systems I Computer Vision II Analysis of Algorithms Reinforcement Learning Deep Learning for Computer Vision Applied Deep Learning
University of Michigan, Ann Arbor
B.S.E in Computer Science || September 2019 - May 2021 Data Structure & Algorithm Intro to Computer Security Web Systems Machine Learning Natural Language Processing Computer Vision
Shanghai Jiao Tong University, Shanghai
B.S.E in Electrical and Computer Engineering || September 2017 - August 2021 Programming & Elem. Data Structures Intro to Logic Design Electronic Circuits Discrete Mathematics Honors Mathematics II-IV Probabilistic Methods
Selected Projects
Monocular Depth Prediction [report] || [code] || September 2022 - December 2022
Final project of COMS 4995 006: Deep Learning for Computer Vision || Instructor: Prof. Peter Belhumeur
Implemented convolutional autoencoder, U-Net, and pretrained + decoder models from scratch for monocular depth prediction tasks.
Developed models that can successfully predict the depth map with small incorrectness.
Automated Essay Scoring Web Application [report] || [video] || [code] || September 2021 - December 2021
Final project of COMS 6998: Cloud Computing & Big Data || Instructor: Prof. Sambit Sahu and Dr. Yuan Zhao
Solved the longstanding problem of high cost and low turnaround of current English writing tests. The goal is to shift testing away from standardized bubble tests to tests that evaluate critical thinking, problem-solving, and other 21st century skills.
Designed frontend and backend architecture and utilized AWS services to build the application, including S3, API Gateway, DynamoDB, Lambda Function, Cognito, and Sagemaker.
Deployed a user-friendly website on AWS S3 connecting with 14 pre-trained machine learning models’ endpoints on AWS Sagemaker, that could provide scores and beat rates on users’ submissions. It gives good predictions on essays with high, medium, and low performances.
3D Semantic Segmentation by Deep Learning [video] || [code] || May 2021 - August 2021
Capstone Project at Shanghai Jiaotong University || Instructor: Dr. Mingjian Li & Dr. Hao Sun
Captured real scenes using Lidar sensor and collected corresponding Lidar point cloud data
Improved baseline segmentation model, Cylinder3D, including data encoding, model architecture, and loss function
Established an index constructor and compresses total disk usage to 2TB by serializing term posting lists using UTF-8 encoding and document deltas when writing index chunks from memory to disk
Ranked retrieved pages using fine-tuned heuristics and static page attributes to achieve a >70% precision
Commonsense Reasoning for Natural Language Inference [report] || [slides] || October 2020 - December 2020
Final project of EECS 595: Natural Language Processing || Instructor: Prof. Joyce Chai
Explored large benchmarks and pretrained models like BERT and Roberta. Applied transfer learning and finetuning skills to develop reasoning frameworks for NLI tasks, including Question Answering, Conversation Entailment and Plausible Inference.
Conducted research on the application of Generative Adversarial Networks on image-to-image translation. Reproduced CycleGAN, StarGAN v1, and StarGAN v2 models for artistic image style transferring and performed quantitative evaluation with FID score and classification accuracy.
Work Experience
Software Engineering Intern Duolingo, Inc. || May 2022 - August 2022
Developed unit assignment prototype with Python (for backend) and TypeScript/React (for frontend) on Duolingo for Schools
Implemented API supports for create, edit, and delete unit assignments, built methods for handling the SNS events, triggering push
notifications, and sending student-facing emails, and updated frontend Teacher Dashboard
Cooperated with product design team on implementation details and designed metrics for user onboarding flow experiment
Design and grade assignments, answer students’ questions on piazza, hold office hours
Teaching Assistant (VP141 Physics Lab I)
Shanghai Jiao Tong University || Instructor: Mateusz Krzyzosiak || May 2019 - August 2019
Teaching Assistant (VY100/VY200 Academic Writing I, II)
Shanghai Jiao Tong University || Instructor: Angela Gehling || September 2018 - May 2019
Research Experience
Deep Learning Model for Making Predictions on Shotgun Metagenomic Sequencing Data
Instructed by Prof. Itsik Pe’er and Vanesa Getseva || [slides] || September 2021 - December 2021
Conduct literature research on shotgun metagenomic sequencing data and collect datasets with proper number of subjects, samples, and time intervals.
Deploy a recurrent neural network (RNN) to make predictions on shotgun metagenomic sequencing data and compare its performance to the performance of traditional Markovian models.
Summary Error Evaluation Project
Instructed by Prof. Lu Wang || September 2020 - May 2021
Evaluate summary quality by annotating four types of un-faithful content and conduct research on abstractive document summarization generation models and summary error evaluations.
Lead weekly meeting of deep learning research team where team members collaboratively researching recent techniques and algorithm regarding applying deep reinforcement learning to improving human learning.
Mapleseed: Sensor Network Laboratory
Instructed by Prof. Xiaogan Liang || [report] || January 2020 - April 2021
Real-time image classification on a NVIDIA Jetson Nano GPU board of particles’ images captured by the optical microscopes on a fast drone.