Projects
Commercial Projects
CER System
Python
基于 Qwen2-Audio 7B 多模态大模型的银行客户服务情绪实时分析系统。该系统采用双路径架构(Dual-Path Architecture),结合 Plan A(高灵敏 VAD)与 Plan B(高精度说话人分离),灵活适配不同质量与场景的通话录音。
#AI#Emotion Recognition#Banking#Multimodal
Personal Projects
AI Emoji Kitchen Lab
Python
这是一个基于深度学习的表情符号(Emoji)融合项目,旨在于通过大模型实现Google Emoji Kitchen风格的表情符号融合,能够将两个不同的emoji融合成一个全新的、富有创意的emoji图像。
#AI#Image Generation#Emoji#CV
PDF Summary Tool
Python
一个强大的Python工具,用于将PDF文档转换为结构化文本,并利用先进的AI模型生成高质量的内容总结和关键词提炼。
#PDF#Summarization#AI#NLP#YOLO
TeX-QB-Gen
Python
Generate question banks directly from textbooks. Leverage OpenRouter's multimodal/text models and local OCR capabilities to extract math problems and solutions from images, web pages, or PDFs, and generate uniformly formatted TeX question banks.
#Education#OCR#VLM#TeX#Question Bank
bibliotheca-runnel
TypeScript
This is the personal academic portal and digital archive of myself. It serves as a curated collection of knowledge, spanning from informal mathematical notes to classical literary works and linguistic research.
#Portfolio#Academic#Next.js#Digital Archive