Welcome!

I am currently a forth-year undergraduate in the School of Software Engineering at Tongji University, and also a visiting student at the University of California, Berkeley since January 2024. My research focuses on data mining, multimodal learning, and natural language processing, especially exploring how to enable vision-language models (VLMs) to perceive the world, reason, and respond to their environment in a human-like manner, grounded in a multimodal context.

Previously, I have interned as a research assistant in the NaMI lab/ADE Lab at Tongji University, MIT Media Lab’s City Science Lab@Shanghai, and the Department of Computer Science at the University of Hong Kong. Currently, I am interning at Berkeley NLP Group within Berkeley Artificial Intelligence Research (BAIR) Lab, working with Prof. Alane and Dr. Zineng to explore the future of vision language models (VLMs).

You can find my CV here: Lingjun Mao’s Curriculum Vitae. If you are interested in my work, please feel free to drop me an email.

I am currently seeking PhD opportunities for Fall 2025. If you have a suitable opportunity, please feel free to contact me.

🔥 News

  • 2024.09:  Our work on multi-perspective communication has been accepted by EMNLP main 2024.
  • 2024.09:  Our work on multimodal instruction-tuning for biomedicine is accepted by NeurIPS D&B 2024!
  • 2024.06:  🎉🎉Our paper “Among Agents” is accepted at ACL Wordplay Workshop 2024. See you in Bangkok!
  • 2024.01:  Thrilled to join the Berkeley NLP Group as an intern! Go bears!
  • 2023.11:  Accepted into the Berkeley Global Access program. Looking forward to California!🐻🔥
  • 2023.07:  Accepted into the University of Hong Kong’s CS summer research internship. A wonderful summer with Prof. Chuan Wu and Dr. Junwei Su!
  • 2022.02:  Join the MIT Media Lab’s CSL@Shanghai.

✍️ Ongoing Projects

Notice: All content is for reference only. Research directions and team members are all subject to change!

sym

VLMs Are Blind! They are Just Listenser Instead of Obsearver

Lingjun Mao, Zineng Tang, Alane Suhr

Project | Comming Soon

  • Content to be released soon.
sym

Dynamic Tokenization: Not Every Image Needs 768 Tokens

Zineng Tang*, Lingjun Mao*, Rudy Corona*

Project | Comming Soon

  • Content to be released soon.
sym

Towards Real-time Interactive Video Language Models

Jessy Lin, Jiayi Pan, Zineng Tang, Lingjun Mao

Project | Comming Soon

  • Content to be released soon.

📝 Publications

sym

How Do Color Illusions Affect The Perception of Vision-language Models?

Lingjun Mao, Zineng Tang, Alane Suhr

Project | Submitted to CVPR 2025

  • We propose an automated framework for generating realistic illusion images and creat a large, realistic dataset of color illusion images.
  • We investigate the underlying mechanisms of color illusions.
sym

Grounding Language in Multi-Perspective Referential Communication

Zineng Tang, Lingjun Mao, Alane Suhr

Project | EMNLP main 2024

  • We introduce a task and dataset for referring expression generation and comprehension in multi-agent embodied environments.
sym

AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game

Yizhou Chi, Lingjun Mao, Zineng Tang

Project | ACL Wordplay Workshop 2024

  • This paper focuses on creating proxies of human behavior in simulated environments, with “Among Us” utilized as a tool for studying simulated human behavior.
sym

Biomedical Visual Instruction Tuning with Clinician Preference Alignment

Hejie Cui*, Lingjun Mao*, Xin Liang, Jieyu Zhang, Hui Ren, Quanzheng Li, Xiang Li, Carl Yang

Project | NeurIPS D&B 2024

  • we propose a data-centric framework (Biomed-VITAl) that incorporates clinician preferences into both stages of generating and selecting instruction data for tuning biomedical multimodal foundation models.
sym

BG-HGNN: Toward Scalable and Efficient Heterogeneous Graph Neural Network

Junwei Su*, Lingjun Mao*, Chuan Wu

Project | Submitted to AAAI 2025

  • We first highlights and demonstrates that the standard approach employed by existing HGNNs inevitably leads to parameter explosion and relation collapse.
sym

AI Agent as Urban Planner: Steering Stakeholder Dynamics in Urban Planning via Consensus-based Multi-Agent Reinforcement Learning

Kejiang Qian, Lingjun Mao, Xin Liang, Yimin Ding, Jin Gao, Xinran Wei, Ziyi Guo, Jiajie Li

Project | Submitted to AAAI 2025

  • we introduce a Consensus-based Multi-Agent Reinforcement Learning framework for real-world land use readjustment.

🎖 Honors and Awards

  • 2024 Berkeley Global Access Scholarship (10 in all)
  • 2023 Xiangcheng High-Tech Scholarship
  • 2023 Xiangcheng High-Tech Scholarship
  • 2023 China Collegiate Computing Contest (CCCC) China Second Prize
  • 2023 China Collegiate Computing Contest (CCCC) Shanghai First Prize
  • 2023 Shanghai Innovation and Entrepreneurship Project Award
  • 2023 ETH Beijing Hackathon finalist
  • 2022 National-level Innovation and Entrepreneurship Project Award
  • 2022 Second prize in the Gobang AI algorithm competition of Tongji University
  • 2022 Excellent backbone of Tongji University Student Union
  • 2022 Outstanding Students of Tongji University (5%)
  • 2022/2023 Third Prize of Asia Pacific Cup Mathematical Modeling
  • 2022 Winner of “Crestron” Circular Economy Competition
  • 2021 Third Prize in Mathematical Modeling, Tongji University
  • 2021/2022 Tongji University Scholarship for Outstanding Undergraduate Students (5%)

📖 Educations

  • 2024.01 - 2024.10, Visiting Student (Berkeley Global Access Exchange Program) in University of California, Berkeley, USA
  • 2020-2025(expected), Software Engineering, Tongji University, Shanghai, China

💻 Internships

  • Feb 2024 - Present: Berkeley NLP Group, Berkeley Artificial Intelligence Research (BAIR) Lab
  • Nov 2023 - Present: Department of Computer Science, Emory University
  • Apr 2023 - Present: Department of Computer Science, University of Hong Kong
  • Apr 2022 - Nov 2023: City Science Lab@Shanghai (MIT Media Lab)
  • Sept 2021 - Nov 2023: Tongji ADE Lab
  • May 2021 - Apr 2023: Tongji NaMI Lab

📚 Projects

sym

Equivariant Neural Networks on Discrete Symmetry Groups

Krishnakumar Bhattaram, Lingjun Mao, Markian Rybchuk

Project | Research Project

  • we introduce an equivariant approach targeting specific discrete symmetry groups with the hypothesis that restricting the function space of learned equivariant functions to the eigenbasis of the Hamiltonian will provide a useful inductive bias and reduced data complexity.
sym

A distributed neural network computation acceleration method based on meta-representation

Kaiyu Huang, Lingjun Mao, Qingjiang Shi

Project | Research Project

  • We replaced the last fully connected layer of the traditional neural network with coded quantized matrix multiplication and compared it with the speed and accuracy of the traditional neural network.
sym

Rethinking of Generalization in Dynamic System

Junwei Su, Lingjun Mao, Mengfan Liu, Chuan Wu

Project | Research Project

  • We point out the flaws in current evaluation methods, which don’t fully consider the dynamic and time-sensitive nature of these networks.
sym

JourneyCam: VR-assisted photography teaching APP

Lingjun Mao, Xin Liang, Manxin Xu

Project | CCCC 2023 Award-Winning Work

  • JourneyCam is a VR-assisted photography teaching app whose core function is to provide tutorials on professional photography with cell phones.
sym

FactLENS DAO -A Decentralized News Validation Ecosystem

Lingjun Mao, Xin Liang, Chance Jiajie Li, Nina Wang, Yongqi Li

Project | ETH Beijing Hackathon Award-Winning Work

  • FactLENS is a decentralized news validation ecosystem, which consists of FactLENS plugin and FactLENS website.
sym

SoCity DAO - Green Commute

Ryan Zhang, Chance Li, Charlotte Ge, Kejiang Qian, Lingjun Mao, Xin Liang, Chengliang Li

Project | non-profit organization

  • SoCity is a non-profit organization to promote prosocial behaviors with decentralized incentive policies. It is conducted by City Science group (MIT Media Lab) and City Science Lab Shanghai (Tongji University).

2024@Lingjun Mao