My research interests are computer vision, multi modal and reinforcement learning, specifically, learning-based methods for 3D point cloud segmentation, multimodal pretrain methods, etc.
If you find any research interests that we might share, feel free to drop me an email. I am always open to potential collaborations.
Before that, I received my B.Sc degree in Computing Science at Hohai University in 2022, and I received the honour of being the Charming Graduate of Hohai University.
I am the former captain of the ACM team at Hohai University, I have chaired the 10th and 11th Hohai University ACM Programming Competition. Also, I ran hohai online judge website for a year.
I am a former OIer at JiangSu DaFeng Senior High School, during that time, I became interested in computer science.
I was born on May 21th, 2000 in Yancheng, China. My hometown is on the shores of the Yellow Sea, with a national nature reserve, also known as the home of the moose.
Research interests
I am working at the intersection between Computer Vision and Multi Modal, developing new deep learning methods to resolve the challenging problems in 3D Vision or text-image alignment, especially focus on segmentation, Pretrain model and Scene Understanding.
My long-term goal is to improve the application of 3D Vision, benefiting society directly by improving people's living environment.
Much of my research is about inferring the physical world (shape, motion, color, light, etc) from images and 3D raw data. Representative researches are highlighted.
Learning medical visual representations through vision-language pre-training has some challenges, i.e., local alignment lacks interpretability and clinical relevance, and the insufficient internal and external representation learning of image-report pairs.
To address these issues, we propose an Anatomical Structure-Guided (ASG) framework.
Project
Much of my projects is about inferring the physical world (shape, motion, color, light, etc) from images and 3D raw data. Representative projects are highlighted.
We get the 0.905 at the leardboard. And reach the Top 4%.
We gather the wiki pedia knowledge about science questions, and use the bag-of-words model to clean the datas.
Then, we use the sentence transformer to find the similarity between the problen and the cleaned dataset.
Training three large deberta models with different datasets, and combining their features to inferring the right answer.
For images generated from text using Stable Diffusion, we use three models BLIP+CLIP, OFA and ViT. Then, we combine their features to predict the text for a given generated image.
This project is about HUAWEI robots application, the project requires us to assign policies, control scheduling, and path planning for multiple robots in a single map.
Selected awards
• The 2019 ICPC Asia-East Continetnt Final - Bronze Medal (2019)
Scale your device and escape from this geometry storm.
This game reached the "Innovation RK1" and "Theme interpretation RK2" at Game Off 2023
Misc
Japanese🇯🇵:
       
I am trying to learn Japanese now. And I plan to take part in the Japanese N2 exam at 2024 Summer.
Sports🏃♂️:
       
Swimming🏊, swimming is my hobby when I was a kid, and I hit 39‘22s in the 50m backstroke.
       
Go, Badminton🏸️, Flying Disc🥏.
Games🎮:
       
I love to play Pokémon-related games such like PTCG, Pokémon Legends: Arceus, etc.
       
I am a fan of Nintendo. The Legend of Zelda is the best game I think.