Hi, I'm 曾贞鹏 👋
CS (AI) · CPC Member Passionate about AI & Full-Stack Dev
ZZP

About

Graduated from Zhejiang Sci-Tech University, CS (AI), ranked 6/157 (top 3.82%), CPC member. Currently pursuing a Master's in Immersive Media Technology at Communication University of China, member of the National Key Lab of Media Convergence · Future Intelligent Audio-Visual Research Center.

Research focus: video analysis, with current emphasis on cognitive analysis of video-level values. Graduate work includes leading a multimodal LLM-based audiovisual value judgment study for next-gen content governance; proposing the training-free CoT reasoning framework TCRS-QA, which won 1st Place (>8B track) at ICCV 2025 Workshop SF20K; and developing a long-video real-time captioning system in collaboration with Southern New Media. Previously contributed as a core member to the national key R&D sub-project on camera position computation (2021YFF0900701-3). Led 5 core research projects as PI during undergrad, spanning deep learning, knowledge graphs, AR/VR and full-stack engineering. 2 invention patents pending.

Won 7 international/national awards (incl. 1 international 1st, 3 national 1st prizes): ICCV 2025 Workshop 1st Place, National Mathematical Modeling 1st Prize (top 0.648%), National Life Science Competition 1st Prize (top 0.19%), etc. Zhejiang Provincial Government Scholarship 2023–2024.

Tech stack: PyTorch / Agent / LLM / VLM · Vue3 / Next.js full-stack · Unity AR/VR · Knowledge Graph (Neo4j) · Video multimodal analysis. CET-4(541) · CET-6(448).

Eager to Learn · Dare to Think · Act with Skill

Skills

Python
PyTorch
Agent
LLM / VLM
Vue3 / Vite
Next.js / TS / JS / HTML / CSS
Flask
Unity / ARFoundation
BERT / BiLSTM / CRF
EfficientNet / ViT
Neo4j / Cypher
MySQL
Selenium / Scrapy
Matlab / Gurobi
PS / PR / 剪映 / 万彩动画
Projects

My Projects

Spanning AI algorithm research, full-stack web development, AR/VR applications and more — multiple results awarded national prizes or provincial grants.

01

Multimodal LLM-Based Value Judgment for Audio-Visual Content

2025.11 - Present

Research on multimodal LLM-based value judgment for audio-visual content, targeting the growing challenge of content governance on short-video platforms. Builds next-gen intelligent content governance algorithms. Key work: (1) daily short-video data monitoring & collection; (2) human-AI collaborative dataset construction for value-oriented fine-grained annotation using Gemini and manual review; (3) compliance analysis of implicit value tendencies in audio-visual content via multimodal LLMs.

PythonPyTorchMultimodal LLMGeminiValue JudgmentContent GovernanceHuman-AI AnnotationSocial Computing
基于多模态大模型的视听内容价值观判别研究
02

TCRS-QA · Training-Free CoT Reasoning for Long-Video Storyline QA 1st Place @ ICCV 2025 Slamo Workshop

2025.06 - 2026.01

Proposed TCRS-QA, a training-free chain-of-thought reasoning framework for shot-aware storyline QA on long videos. Built on Qwen2.5-VL-7B and Qwen2.5-7B, with novel Perception Encoder (PE), Adaptive Keyframe Sampling (AKS), and narrative rhythm detection modules for zero-shot high-quality QA. Achieved 1st place (LLM-Score) in the public >8B track at ICCV 2025 Slamo Workshop SF20K competition.

PythonPyTorchQwen2.5-VL-7BQwen2.5-7BChain-of-ThoughtVideo QAZero-Shot LearningMultimodal Reasoning
TCRS-QA · 长视频故事线问答免训链式思维推理
ICCV 2025 Slamo Workshop 竞赛第1名
03

Real-Time Online Long-Video Captioning System

2024.09 - 2025.06

A multimodal learning-based video description generation system supporting research on multimodal fusion and semantic alignment. Built with Vue3+Flask, supports local upload and URL upload (Bilibili/Tencent Video), handles videos up to 30 minutes, generates real-time captions, and exports WebVTT subtitle files and captioned videos. Collaboration established with Southern New Media.

Vue3FlaskPythonPyTorchBLIP-2Multimodal LearningVideo Understanding
04

Voice Cloning Technology Research & Application for Dubbing

2024.04 - 2024.07
声音类型原音原音文本配音训练本人音
表演相声也没有这胡子,也没有眼睫毛,啊,就(轻)鼻须很长
广告女声新鲜时蔬,干锅酱料,现在,肯德基也吃得到,干锅时蔬配鲜嫩鸡腿肉,给汉堡加菜,肯德基干锅时蔬鸡腿堡,现已加入豪华午餐,生活如此多娇
小男孩声我是茶屋的人,但是最近,好像茶屋的客人越来越少了,所以我才想着有什么我可以帮忙的,就开始寻找茶好喝的店,就来到了这里,嗳,真..真不好意思,说了奇怪的话
嘻哈女声同学们,我是曾老师,是一只永远打不倒的小乔
情感女声哼,不用怕,付得起,咱们之前一起写的那本小说最近又在版了,你们的稿费还不少呢
底噪男声以往,生成一个商业智能仪表盘,可能需要数个小时的时间,但现在,几分钟就可以搞定了
PythonBert-VITSWhisper-PPGRMVPETTSVoice ConversionSpeech Synthesisffmpeg
05

Intelligent Virtual Camera Position System for Cloud Performance · National Key R&D Sub-project (2021YFF0900701-3)

2023.10 - 2025.01

Supporting system for the national key R&D sub-project 'Camera Position Computation Based on Shot Language Analysis' (No. 2021YFF0900701-3). As a core member, led shot motion recognition algorithm research including classification model design and training; advanced 3D reconstruction algorithms and completed standardized API encapsulation. Built on Vue3+Flask with three-end collaboration (business, algorithm, Unity) for real-time high-precision virtual camera position computation and visualization.

Vue3FlaskPythonUnityPyTorch3D ReconstructionCamera Motion RecognitionClassification ModelRESTful API
06

Deep Learning-Based DNA Methylation Modification Recognition

2023.03 - 2023.09

Addressed the high equipment requirements and scalability limitations of traditional detection methods for three DNA methylation types. Based on Nanopore sequencing current perturbation, used GAF to convert hundreds of GB of 1D current signals into 2D images; built EfficientNet, SE-ResNet, and ViT models achieving 88% accuracy (AUC=0.96). Results: National 1st Prize in Life Science Competition (top 0.19%), Zhejiang Provincial Xinmiao Talent Program, ZSTU Key Undergraduate Innovation Program, 1 invention patent pending.

PythonPyTorchViTEfficientNetSE-ResNetGAFNanopore
基于深度学习的DNA甲基化修饰识别算法研究
07

JingGe · Military Knowledge Graph Auto-Construction & Intelligent Reasoning System

2022.12 - 2023.06

Addressed inefficiency of traditional military knowledge retrieval. Designed three modules: data collection (PDFPlumber+OpenCV+Gaussian filter), extraction & storage (BERT-BiLSTM-CRF/Attention for NER at 93.13% and relation extraction at 93.11%, full pipeline under 5 min), and reasoning (MySQL-Neo4j linkage via Apoc, Cypher queries, D3.js visualization). Won National 3rd Prize at the 14th Service Outsourcing Competition (top 9.091%).

PythonBERTBiLSTMCRFNeo4jMySQLD3.jsCypherOpenCV
鲸戈 · 军事知识图谱自动构建与智能推理分析系统
08

StarLight · AR-Based Therapeutic Mobile App for High-Functioning Autism

2022.08 - 2022.11

Developed a therapeutic app for high-functioning autism patients using AR robot 'Apiao' as the carrier. Implemented three core modules in Unity/C#: emotion perception (joy/sadness/love), interest cultivation (puzzle/drawing/music/creation), and cognitive imitation. Used ARFoundation SDK for AR scenes, ManoMotion for bare-hand gesture interaction, and facial/emotion recognition to build AR companionship and game incentive mechanisms.

UnityC#ARFoundationManoMotionAREmotion Recognition
星星点灯 · 高功能自闭症患者辅助治疗移动端软件
09

Literature Management Platform with Python Web Crawler

2022.04 - 2022.08

Sole developer. Solved the problem of disorganized literature downloads and vertical-domain knowledge retrieval. Used xlml/requests/selenium to crawl CNKI data (handling iframe nesting), with PyMySQL for data updates. Built full-stack with Vite+Vue3, Element-UI, Echarts, WangEditor, and SpringBoot, supporting conditional search, one-click literature info retrieval, annotation/classification/favorites, auto-categorized downloads, and progress tracking. Awarded Outstanding Result at ZSTU Advisor Project.

PythonSeleniumVue3ViteSpringBootElement-UIEchartsMySQL
基于Python文献爬虫的前后端分离文献整理平台
AI Products

Independent AI Applications

7 AI web applications (4 independently developed, 3 collaborative) — academic search, citation verification, literature review, AI conference platform, workshop website and more.

Tech Stack · Next.js → Supabase → Vercel

Next.jsTypeScriptTailwindCSSSupabaseVercelNamesiloReact
01
SubSearch-AI🏆 CUC 4th Elite Cup · Sole 1st Prize (Graduate Track)

Academic Subscription & Intelligent Search

Multi-source academic search with AI intent detection, journal/conference filters, academic metrics, and one-click survey reports; personal archive with AI summaries, annotations & visualization export; scholar & keyword subscription with auto email push.

02

Academic Search & Domain Research

High-freedom multi-source search with AI intent detection, academic metrics & one-click survey reports; dataset tracking to locate sources & latest citations; curated hub with top venues, GitHub repos, CCF, HuggingFace, arXiv & AI media links.

PaperOK
03

Intelligent Academic Citation Verification

One-click citation generation in standard formats; AI citation hallucination detection; source forgery detection by cross-referencing original papers; attribution tracing to track the origin of arguments and ideas.

CiteOK
04

ICAIS 2025 AI Academic Development Competition

An academic platform independently developed before ICAIS 2025's AI development competition. Its features precisely align with the conference theme of AI-assisted research: real-time Q&A, multi-turn idea generation, 16-dimension paper review, and research report generation. Also participated in the literature review track, demonstrating forward-looking exploration of AI-assisted scientific research.

LivePaper · ICAIS
05

Official Workshop Website @ ICCV 2025

Independently developed official website for the 9th HANDS Workshop @ ICCV 2025. Theme: 'Observing and Understanding Hands in Action', focusing on multimodal LLMs for hand-related perception tasks. Features invited talks, paper presentations, poster sessions and challenge awards. Co-organized by University of Birmingham, NUS, Meta, Brown University and others. Held Oct 20, 2025 at Hawai'i Convention Center.

HANDS Workshop · ICCV 2025
06

AI-Powered Citation Verification & Source Tracing

Collaborated on an AI academic citation verification platform. Features: AI Source Finder across CrossRef, PubMed, arXiv & Google Scholar; Citation Checker with 95%+ accuracy for detecting fabricated references; AI Research Assistant for real-time guidance. Supports APA/MLA/Chicago/Harvard/IEEE formats with Zotero & Mendeley integration.

Citely
07

AI Literature Review Generation & Research Platform

Collaborated on an AI literature review platform. Features: parallel multi-database search (Google Scholar, PubMed, Semantic Scholar) with auto-optimized queries; AI review generation grounded in real papers with zero hallucinations and drag-and-drop outline editing; export to Word/PDF/LaTeX/Markdown with one-click sync to Overleaf, Zotero, Mendeley & EndNote.

Literfy
Awards

Awards & Certificates

International champion · 6 national awards (incl. 3 first prizes) · 3 software copyrights · multiple provincial honors

National1st Prize
2022 National Mathematical Modeling Competition · National 1st Prize

2022 National Mathematical Modeling Competition · National 1st Prize

National1st Prize
8th National Life Science Competition · National 1st Prize

8th National Life Science Competition · National 1st Prize

National1st Prize
12th MathorCup Mathematical Modeling Challenge · National 1st Prize

12th MathorCup Mathematical Modeling Challenge · National 1st Prize

National 2nd
2021 National Mathematical Modeling Competition · National 2nd Prize

2021 National Mathematical Modeling Competition · National 2nd Prize

National 3rd
14th China College Service Outsourcing Competition · National 3rd Prize

14th China College Service Outsourcing Competition · National 3rd Prize

International
2022 MCM (USA) · Honorable Mention

2022 MCM (USA) · Honorable Mention

Provincial 1st
2021 National Mathematical Modeling Competition · Zhejiang Provincial 1st Prize

2021 National Mathematical Modeling Competition · Zhejiang Provincial 1st Prize

Provincial 1st
15th Life Science Competition · Zhejiang Provincial 1st Prize

15th Life Science Competition · Zhejiang Provincial 1st Prize

Copyright
Software Copyright · Camera Position Computation for Cloud Performance

Software Copyright · Camera Position Computation for Cloud Performance

Copyright
Software Copyright · Crawler-Based Personal Literature Management

Software Copyright · Crawler-Based Personal Literature Management

Copyright
Software Copyright · Multimodal Learning-Based Video Description Generation

Software Copyright · Multimodal Learning-Based Video Description Generation

Provincial
18th Zhejiang Province Challenge Cup · Provincial 3rd Prize

18th Zhejiang Province Challenge Cup · Provincial 3rd Prize

Provincial
21st Zhejiang Province Multimedia Competition · Provincial 3rd Prize

21st Zhejiang Province Multimedia Competition · Provincial 3rd Prize

Provincial
Zhejiang Province Higher Mathematics Competition · Provincial 3rd Prize

Zhejiang Province Higher Mathematics Competition · Provincial 3rd Prize

Competitions

Competitions & Awards

9 national/international competitions, covering mathematical modeling, AI algorithms, AR/VR and more.

  • I

    Public >8B Track · LLM-Score Rank 1st

  • N

    Top 0.648% Nationally

  • N

    Top 0.190% Nationally

  • M

    Top 5% Nationally

  • M

    Top 30% Globally

  • N

    Top 2.311% Nationally

  • C

    Top 9.091% Nationally

  • Z

    Zhejiang Province

  • Z

    Zhejiang Province

Activities

Social Activities & Volunteer

Social practice, volunteer service, and honorary roles — growth beyond technology.

Social Practice

Patriotic Poetry Recitation for the Party's Centenary

2021

Participated in the 'Welcoming the Centenary · Gathering Patriotic Hearts' May Fourth patriotic poetry recitation event; awarded 3rd Prize in Social Practice by the college.

Social Practice

Rare Disease Public Welfare Outreach

2022

Participated in a public welfare outreach event for rare disease patients, spreading awareness deep into communities; awarded 3rd Prize in Social Practice by the college.

Social Practice

Tracing Red Memories · Exploring the 'City of Dawn'

2023

Conducted red culture research in revolutionary base areas, excavating and documenting local red history; awarded 3rd Prize in Social Practice by the college.

Volunteer

Patient Navigation Volunteer at Sir Run Run Shaw Hospital

Served as a volunteer at ZJU Sir Run Run Shaw Hospital, assisting patients with navigation and information inquiries.

Volunteer

Cycling Open Championship Event Volunteer

Served as an event volunteer at a cycling open championship, participating in on-site services and order maintenance.

Volunteer

COVID-19 Vaccination Volunteer

Participated in COVID-19 vaccination site volunteer service, assisting with queue management and registration.

Volunteer

'Meet in Hi-Tech' Community Volunteer Activity

Participated in community volunteer service in the Hi-Tech District, contributing to local community building and public welfare.

Honorary Role

Media Dept. Outstanding Staff Star · ZSTU Innovation & Entrepreneurship Association

Served as a staff member in the Media Department of ZSTU's Innovation & Entrepreneurship Association, responsible for event planning and new media operations; awarded Outstanding Staff Star.

Honorary Role

Class President · 9 Consecutive Years

Served as class president for 9 consecutive years, leading daily class management, activity organization, and student coordination, developing strong team leadership and communication skills.

Party Member Development Study Sharing
Party

Party Member Development Study Sharing

Rare Disease Public Welfare Volunteer
Volunteer

Rare Disease Public Welfare Volunteer

COVID-19 Vaccination Volunteer
Volunteer

COVID-19 Vaccination Volunteer

Zhejiang Sci-Tech University Campus
Campus

Zhejiang Sci-Tech University Campus

Tuya Smart Campus Ambassador Event
Media

Tuya Smart Campus Ambassador Event

AR/VR Application Demo
Research

AR/VR Application Demo

Other

More Highlights

More moments beyond competitions and research.

OtherPlaneSocialSocial 2OtherPlaneSocialSocial 2

License Plate Detection

Face Recognition

Contact

Get in Touch

Feel free to reach out via email or WeChat. Always open to interesting projects and opportunities. 2473910949@qq.com