Tianyi Xu
CDIS, University of Wisconsin–Madison
Madison, WI 53706
I’m Tianyi Xu, a senior at the University of Wisconsin–Madison (B.S. in Computer Science, Data Science, and Mathematics; expected Dec 2025). I am advised by Prof. Junjie Hu and Prof. Claudia Solís-Lemus at UW–Madison. I work closely with Shaobo Wang and Prof. Linfeng Zhang at Shanghai Jiao Tong University. I’ve previously collaborated with Prof. Pedro Morgado and Prof. Chaowei Xiao.
I’m interested in building modern AI systems that are efficient, generalizable, and capable of understanding and acting across multiple modalities, especially under limited supervision and real-world constraints. Concretely, my work focuses on:
-
Data-centric & label-efficient learning — Designing methods for data selection, mixing, and self/weak supervision so that large-scale models can learn from noisy, heterogeneous data instead of only clean benchmarks.
-
Foundation models & agents— Building and steering pretrained models (LLMs, VLMs, etc.) for specific tasks, with an emphasis on efficiency, reliability, and agentic behaviors such as planning, reasoning, and safe decision-making.
-
Multimodal learning — Representation learning and building systems that perceive, act, and reason across different modalities.
-
AI for science and society — Applying these ideas to real-world problems where data is scarce or noisy: biodiversity monitoring from soundscapes, tone-aware speech modeling for accessibility, and multimodal toolkits/benchmarks for biomedical and clinical AI.
I’m seeking Ph.D. opportunities for Fall 2026.
selected publications
- WACVSelf-Supervised Sound Detection with AudioMAE for Robust, Label-Efficient Biodiversity MonitoringIn CV4EO Workshop, WACV, 2026Under review
- Environ. Microbiol.
Combined effects of methyl bromide and soil amendments on soil bacterial and fungal communities in turfgrassApplied Soil Ecology, Oct 2025Under review - Rhizosphere
Stem rot affects the structure of rhizosphere microbiome in berseem clover (Trifolium alexandrinum)Rhizosphere, Apr 2025 -
BiomedBank: A Large-scale, Multimodal Data Ecosystem for Advancing Biomedical AIApr 2025In preparation