Junbo ZHAO 赵浚博

Junbo ZHAO 赵浚博

Currently pursuing a Master's degree in Computer Science at Artificial Intelligence School, Beijing Normal University

About Me

I am a Computer Science Master's student at Artificial Intelligence School, Beijing Normal University, under the supervision of Professor Hua Huang. I graduated with a Bachelor's degree from Minzu University of China, where I was honored as an Outstanding Graduate of Beijing.

My academic interests include Educational AI, Multimodal LLM, and Computer Vision. I'm passionate about developing AI systems that can enhance learning experiences and solve complex problems through multimodal understanding.

Outside of academia, I enjoy exploring new places, capturing moments through photography, and staying active. I believe that maintaining a balance between intellectual pursuits and personal interests is essential for creativity and well-being.

Hobbies & Interests

Traveling Photography Badminton Apex Legends

Education

Sep 2023 - Jul 2026

Beijing Normal University (Project 985, THE 146)

Master of Computer Science

Advisor: Professor Hua Huang (Dean, Distinguished Young Scholar)

Research Areas: Educational LMMs, Multimodal LLMs, Computer Vision

Major Courses: Computer Graphics, Computer Vision, etc.

Student Work: Teaching Assistant for Linear Algebra class (Mar 2024 – Jul 2024)

Sep 2019 - Jul 2023

Minzu University of China (Project 985)

Bachelor of Computer Science

Honors: Outstanding Graduate of Beijing

Major Courses: Algorithm Analysis, Databases, Data Structures, Software Engineering, Data Mining, etc.

Student Work: President, Minzu University of China Computer Association (Sep 2021 – Jul 2022)

Research Projects

Sep 2024 - Mar 2025

Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information (ICCV 2025, First Author)

Authors: Junbo Zhao1†, Ting Zhang1†, Jiayu Sun1, Mi Tian2, Hua Huang1✉

Institutions: 1Beijing Normal University, 2TAL

Paper Link: https://arxiv.org/abs/2503.05543

Overview: We proposed a novel framework for geometry problem solving that leverages diagrammatic information to resolve textual ambiguities. Our approach combines multi-modal understanding with symbolic reasoning to achieve state-of-the-art performance.

Key Components:

  • Rectifier: Utilizes multi-modal language models (MLLMs) to disambiguate text by incorporating diagrammatic context
  • Verifier: Ensures refined text adheres to geometric rules, effectively reducing model hallucinations
  • Symbolic Solver: Combines neural parsing with symbolic reasoning for robust problem solving

Results: Pi-GPS achieves state-of-the-art results, outperforming previous neural-symbolic methods by nearly 10% on standard benchmarks, including Geometry3K (77.8% on choice tasks) and PGPS9K (69.8% on choice tasks).

Work Experience

May 2025 - Aug 2025

Mininglamp Technology

MLLM Algorithm Intern

Paper Link: https://arxiv.org/abs/2509.17336

Participated in building UI-Agent tasks and visual multimodal planning models, contributing to model training, dataset design, and case analysis.

Main Responsibilities:

  • Capability testing of the base model
  • Construction of high-quality multimodal chain-of-thought (CoT) datasets
  • Conducted SFT and GRPO fine-tuning on the model

Achievements: Improved model accuracy and successfully facilitated the model's online deployment.

Skills

Programming

C Python Java Vue.js

Frameworks

SpringBoot SpringCloud Design Patterns Distributed Development

LLM Research

SFT Data Construction Agent Development SFT GRPO Training Multimodal Models Capability Evaluation

Honors & Awards

Jul 2024

Academic Second-Class Scholarship

Beijing Normal University

Jun 2021

First Prize – Gomoku Project

15th Chinese Collegiate Computer Game Competition

Nov 2020

Second Prize

11th Lanqiao Cup C/C++ Programming Competition, Beijing Division

Jul 2020

First-Class Academic Scholarship

Minzu University of China