About
A highly analytical and results-driven AI/ML Engineer with a Master's degree in Computer Science and a strong foundation in computer vision, deep learning, and full-stack development. Proven ability to design, develop, and deploy complex AI models and applications, as demonstrated by optimizing stereo matching speed by 1270% and developing an intelligent robotic arm task understanding system with a loss error below 3.1. Eager to leverage expertise in cutting-edge AI frameworks and robust software engineering to drive innovation and solve complex problems in challenging technical environments.
Work
→
Summary
As a Visual Algorithm Researcher, I led advanced R&D in computer vision and AI, delivering innovative solutions for robotic arm control, object detection, and real-time stereo matching with significant performance gains.
Highlights
Developed an intelligent robotic arm task understanding system using RoboTwin and Instructpix2pix, enabling natural language instruction comprehension and precise end-effector trajectory prediction with a loss error below 3.1.
Engineered a YOLOv8-based object detection and aiming system for CSGO and APEX datasets, achieving 85.38% accuracy at 6.3 frames per second while bypassing anti-cheat mechanisms.
Improved real-time stereo matching speed by 1270%, from 3.3fps to 42fps, by integrating the GCE module into PSMnet's cost aggregation and optimizing the CoEx disparity regression module.
→
Summary
As a Software Engineer Intern, I contributed to front-end development and UI component integration for the Water Rabbit Project, enhancing web application functionality and user experience.
Highlights
Encapsulated SDK-Web UI components from Photoshop designs, converting them into layout files and integrating them into the Demo App for styling and configuration testing.
Performed daily website maintenance and page updates, ensuring content timeliness and implementing dynamic animations using CSS3 and JavaScript.
Managed front-end and back-end integration, ensuring seamless data flow and functionality for web applications.
Education
Languages
Mandarin Chinese
English
Russian
Skills
Programming Languages
Python, C++, Java, JavaScript, HTML, CSS.
AI & Machine Learning
Object Detection, Stereo Matching, Perception Algorithms, LLAMA, Agent Models, Image Enhancement, Machine Learning, Visual Prediction, Deep Learning, Large Language Models (LLM), Instructpix2pix, RoboTwin, PSMnet, QWEN-plus, OpenManus.
Web Development
Django, React, Vue.js, Full-stack Development, Front-end Development, Back-end Development, UI Component Encapsulation, WebUI.
Tools & Technologies
Linux, Windows, Ubuntu, Docker, Git, MySQL, OpenCV, MMDetection, Project Deployment, Version Control.
Methodologies
ReAct Framework, Multimodal AI, Problem Solving.