publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. ICML
    OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
    Shaobo Wang*, Xuan Ouyang*, Tianyi Xu*, Yuzheng Hu, Jialin Liu, Guo Chen, Tianyu Zhang, Junhao Zheng, Kexin Yang, Xingzhang Ren, Dayiheng Liu, and Linfeng Zhang
    In ICML, 2026
    Under review
  2. ACL
    SITA: Learning Speaker-Invariant and Tone-Aware Speech Representations for Low-Resource Tonal Languages
    Tianyi Xu*, Xuan Ouyang*, Binwei Yao, Shoua Xiong, Sara Misurelli, Maichou Lor, and Junjie Hu
    In ACL, 2026
    Under review
  3. WACV
    Self-Supervised Sound Detection with AudioMAE for Robust, Label-Efficient Biodiversity Monitoring
    Tianyi Xu, Claudia Solís-Lemus, Daniel Pimentel-Alarcon, and Zuzana Burivalova
    In CV4EO Workshop, WACV, 2026
    Under review

2025

  1. Soil Use Manag.
    network.png
    Combined effects of methyl bromide and soil amendments on soil bacterial and fungal communities in turfgrass
    Tianyi Xu*, Salma Mukhtar*, Evan Gorstein, Claudia Solis-Lemus, Ming Yi Chou, and Paul Koch
    Soil Use and Management, Oct 2025
    Under review
  2. Rhizosphere
    rhizosphere.png
    Stem rot affects the structure of rhizosphere microbiome in berseem clover (Trifolium alexandrinum)
    Salma Mukhtar, Zain Ahmad, Noor Khan, Tianyi Xu, and Dalaq Aiysha
    Rhizosphere, Apr 2025
  3. biomedbank.png
    BiomedBank: A Large-scale, Multimodal Data Ecosystem for Advancing Biomedical AI
    TBD
    Apr 2025
    In preparation