Tianyi Xu
CDIS, University of Wisconsin–Madison
Madison, WI 53706
I’m Tianyi Xu, graduated from the University of Wisconsin–Madison (B.S. in Computer Science, Data Science, and Mathematics). I am advised by Prof. Junjie Hu at UW–Madison. I work closely with Shaobo Wang and Prof. Linfeng Zhang at Shanghai Jiao Tong University, and Prof. Claudia Solís-Lemus at UW–Madison.
I’m interested in building modern AI systems that are efficient, generalizable, and capable of understanding and acting across multiple modalities, especially under limited supervision and real-world constraints. Concretely, my work focuses on:
-
Data-centric & label-efficient learning — Designing methods for data selection, mixing, and self/weak supervision so that large-scale models can learn from noisy, heterogeneous data instead of only clean benchmarks.
-
Foundation models & agents— Building and steering pretrained models (LLMs, VLMs, etc.) for specific tasks, with an emphasis on efficiency, reliability, and agentic behaviors such as planning, reasoning, and safe decision-making.
-
Multimodal learning — Representation learning and building systems that perceive, act, and reason across different modalities.
-
AI for science and society — Applying these ideas to real-world problems where data is scarce or noisy: biodiversity monitoring from soundscapes, tone-aware speech modeling for accessibility, and multimodal toolkits/benchmarks for biomedical and clinical AI.
I’m seeking Ph.D. opportunities for Fall 2026.
selected publications
- ICMLOPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every IterationIn ICML, 2026Under review
- ACLSITA: Learning Speaker-Invariant and Tone-Aware Speech Representations for Low-Resource Tonal LanguagesIn ACL, 2026Under review
- WACVSelf-Supervised Sound Detection with AudioMAE for Robust, Label-Efficient Biodiversity MonitoringIn CV4EO Workshop, WACV, 2026Under review
- Soil Use Manag.
Combined effects of methyl bromide and soil amendments on soil bacterial and fungal communities in turfgrassSoil Use and Management, Oct 2025Under review - Rhizosphere
Stem rot affects the structure of rhizosphere microbiome in berseem clover (Trifolium alexandrinum)Rhizosphere, Apr 2025 -
BiomedBank: A Large-scale, Multimodal Data Ecosystem for Advancing Biomedical AIApr 2025In preparation