Bingsheng Arthur Yao
Associate Research Scientist at Northeastern University
I am an associate research scientist in the Khoury College of Computer Sciences at Northeastern University (PI: Prof. Dakuo Wang). I received my PhD from Rensselaer Polytechnic Institute, advised by Prof. Jim Hendler.
I study how LLM agents can genuinely collaborate with humans through the lens of remote human collaboration. My research sits at the intersection of Human-Computer Interaction and Natural Language Processing, with a focus on designing the interaction patterns and methodologies for effective human-agent collaboration in practice, and on developing and evaluating LLM agents that can think and behave collaboratively.
Research
I. Human-Centered AI for Clinical Care
Healthcare workflows often break down at the seams between provider teams, patients, and caregivers. I investigate gaps in stakeholders’ communication and coordination across different clinical scenarios, and design, deploy, and study AI/LLM-powered multi-modal systems that work in those gaps to support the people involved. My work spans clinical decision-making, patient-provider communication, provider coordination, remote patient monitoring, and post-surgical care.
- (CSCW '26) RECOVER: LLM-Based Remote Monitoring System for Post-Surgical Cancer Care
- (CHI '26) Providers' Perspectives for Serious Illness Conversations in the ED
- (CHI '26) Collaboration Breakdowns Between Provider Teams and Patietns in Post-Surgery Care
- (CHI '25) CardioAI: Multi-Modal AI Remote Monitoring System for Cancer Treatment-Induced Cardiotoxicity
- (IMWUT '24) Talk2Care: LLM-Based Conversational Assistant for Asynchronous Older Adults-Provider Communication
- (CHI '24) AI-Assisted Clinical Decision Making for Sepsis Diagnosis
II. Genuine Human-Agent Collaboration
I envision a near future where LLM agents can genuinely collaborate with us, not for us, like a remote human collaborator. To get there, I design interaction patterns and study methodologies for human-agent collaboration, and develop and benchmark LLM agents that can think and behave collaboratively. My work spans collaboration frameworks and design philosophy, trust and shared context in human-agent teams, evaluation methods for collaboration quality, agent oversight, and role-playing agents.
- (CHI '26) Configurable Research Platform for Controlled Human-Agent Collaboration Experiments
- (CHI '26) LLM Agent Susceptibility to Manipulative Interfaces
- (CHI EA '26) A vision, design philosophy, and empirical framework for Human-Agent Collaboration
- (CHI EA '26) Agent A/B: Automated and Scalable A/B Testing Simulation
- (Preprint, 2025) Dynamic Persona Refinement Framework for Optimizing Behavioral and Cognitive Alignment
- (ACL Findings '25) Survey of LLM Role-Playing Agent Evaluation
Note
Please refer to my Google Scholar page for the most up-to-date publication record.
The best way to reach out is through emails: b [dot] yao [at] northeastern [dot] edu.
News
| 2026.03 | Five papers (two first-authored) and one workshop were accepted to CHI 2026! |
|---|---|
| 2026.02 | Two papers (one first-authored) were accepted to CSCW 2026! |
| 2025.11 | I received a Microsoft Agentic AI Research and Innovation (AARI) award to support my research on human-agent collaboration. |
| 2025.02 | A Co-first authored survey paper of LLM Role-Playing Agent Evaluation is now publicly available. |
| 2025.01 | Five papers were accepted to CHI 2025! |
| 2024.10 | Our paper StorySparkQA Dataset with Real-World Knowledge for Children Education was accepted to EMNLP 2024 |
| 2024.09 | Our paper Secret Use of Large Language Models was accepted to CSCW 2025 |
| 2024.07 | Our paper Early Sepsis Prediction with Uncertainty Quantification and Active Sensing was accepted to KDD 2024 |
| 2024.04 | Our papers LLM-based Voice Assistant for Asynchronous Older Adults-Care Provider Communication and Mental-LLM were accepted to IMWUT 2024 |
| 2024.03 | First-authored paper In-Context Sampling Strategy for Reliable LLM Prompting was accepted to NAACL 2024 Findings |