Minh Duc (David) Chu

PhD candidate at the USC Information Sciences Institute, advised by Luca Luceri (SIGNALS Lab) and Kristina Lerman.

Incoming Anthropic AI Safety Fellow (Summer 2026), focusing on work that advances AI alignment and safety.

From Vũng Tàu, Việt Nam 🇻🇳. Based in Los Angeles, CA 🇺🇸.

Minh Duc (David) Chu
Scroll

About

I'm a Computer Science PhD candidate at the USC Information Sciences Institute, advised by Luca Luceri (SIGNALS Lab) and Kristina Lerman. This summer I'm an incoming Anthropic AI Safety Fellow, working on research that advances AI alignment and safety.

I grew up in Vũng Tàu 🇻🇳, a coastal city in southern Việt Nam, and now live in Los Angeles 🇺🇸. Outside research I box, play tennis, and read more philosophy of mind than is probably healthy for an empiricist.

I'm a Computer Science PhD candidate at the USC Information Sciences Institute, advised by Luca Luceri (SIGNALS Lab) and Kristina Lerman (Luddy School of Informatics, Computing and Engineering, Indiana University).

My research sits at the intersection of AI safety and alignment and mental health. I focus on how repeated interactions with conversational AI can compound into harmful trajectories — emotional dependency, belief spirals, and eating-disorder reinforcement — especially for teens and other vulnerable users.

I translate these psychosocial risks into concrete rubrics, red-teaming harnesses, and post-training recipes (SFT, RLHF, preference optimization) in partnership with clinicians and social scientists. My work bridges computational and clinical communities, with publications at venues like NAACL and EMNLP as well as clinical journals such as Body Image and the International Journal of Eating Disorders.

I'm an incoming Anthropic Fellow (AI Safety).

Research

I work on AI alignment and safety, with a current focus on the shift from AI-as-assistant to AI-as-companion.

AI Alignment & SafetySocio-technical AlignmentHuman–AI CompanionshipModel PsychologyModel WelfareCharacter TrainingSocial NLP
  1. 01

    Assistant → Companion

    What changes about safety when people stop using LLMs and start confiding in them.

  2. 02

    Model Psychology & Welfare

    Stable traits, drives, failure modes — and what we may owe entities trained to feel like someone.

  3. 03

    Character Training

    How voice, values, and refusals get baked in at scale.

  4. 04

    Aligning to Communities

    Tuning LLMs to specific online communities without flattening their language or norms.

  5. 05

    Computational Social Science

    Surfacing harm patterns — body image, eating disorders — across Twitter, Reddit, TikTok.

  6. 06

    LLM-Agent Info Ops

    Emergent coordinated behaviour in networked LLM agents and its strategic dynamics.

Publications

Google Scholar
  1. 01
    2026

    A Multimodal TikTok Dataset of Ecuador's 2024 Political Crisis and Organized Crime Discourse

    Charles Bickham, Bryan Ramirez-Gonzalez, Minh Duc Chu, Kristina Lerman, Emilio Ferrara

    ICWSM 2026

Contact

Happy to hear from anyone working on AI alignment, model welfare, or social NLP — and from anyone in Los Angeles looking for a tennis partner.