Yiheng Zhang (张怡蘅)

PhD Applicant in Computer Science · University of Wisconsin-Madison, Computer Science

  • I am Yiheng Zhang, a senior student at University of Wisconsin-Madison studying computer science, advised by Prof. Grigoris Chrysos. My GPA during the study in University of Wisconsin-Madison was 3.95/4.0.
  • My research interests are machine learning and natural language processing, with a focus on large language models (LLMs), diffusion language models (DLLMs), and trustworthy AI. In particular, my projects focused on teaching small arithmetic tasks to transformer, and remasking skills in diffusion language models.
  • I am currently applying to Ph.D. programs for Fall 2026 and internships for Summer 2026. Please feel free to contact me if you think I would be a good fit. Thank you.

Projects

Large Language Models for Arithmetic Reasoning
Machine Learning · Feb. 2025 - Jun. 2025

Designed and experimented over arithmetic tasks on Large Language Models, resulted in better generalization length

LLMMachine Learning
Diffusion Language Models and Remasking
Machine Learning · May. 2025 - Today

Implemented the code base for diffusion training, currently working on remasking tasks.

Diffusion Language ModelsDeep Learning

Reading Group

Internal reading group for sharing SoTA papers and new ideas.

Time Every Wednesday and Friday
DateThemeMaterial
2025-05-30Understanding Diffusion Models: A Unified PerspectiveLink
2025-06-10DDPM, DDIM and GuidanceLink
2025-06-19Large Language Diffusion ModelsLink
2025-09-10Demystifying Foreground-Background Memorization in Diffusion ModelsLink
2025-09-26Persona Features Control Emergent MisalignmentLink
2025-10-10TraceDet: Hallucination Detection from the Decoding Trace of Diffusion Large Language ModelsLink