Projects

馃 Research Projects

馃敼 LLM Collaboration With Multi-Agent Reinforcement Learning

Keywords: Dec-POMDP 路 CTDE-inspired training 路 MAGRPO 路 Multi-turn LLM collaboration

Formalized multi-LLM collaboration as a cooperative Dec-POMDP: each agent observes partial, prompt-based information and outputs natural-language actions; the system/reward model provides joint rewards over multi-turn interactions

Introduced MAGRPO, extending GRPO to multi-agent, multi-turn settings by sampling groups of joint rollouts and using group-relative Monte-Carlo advantages to coordinate agents without training a large centralized value model

Evaluated on two collaboration domains:

Writing: 2 Qwen agents learn complementary roles (concise TLDR + detailed summary; background + method/experiments for arXiv-style expansion), improving structure/style coherence vs. prompt-level baselines

Coding: 2 Qwen coder agents generate a helper + main function; rewards include structure/syntax/tests and an explicit cooperation bonus when main correctly uses helper; introduced CoopHumanEval to reduce non-cooperative noise in HumanEval

Empirically, MAGRPO outperforms single-model and prompt-only multi-agent baselines in overall return and cooperation metrics, especially in multi-turn coding with external feedback signals


馃敼 YUSE: Symbolic Testing for Floating-Point Programs

Keywords: Symbolic Execution 路 Floating-Point Errors 路 Software Testing

  • Developed techniques for detecting floating-point inconsistencies and corner-case bugs
  • Focused on numerical instability, path explosion, and solver scalability
  • Contributed to modeling, testing, and evaluation components of the framework
  • This project strengthened my interest in robust and reliable systems

馃敼 Parallel Computing on RTEMS (Paor Project)

Keywords: Parallelism 路 Embedded Systems 路 Performance Optimization

  • Worked on system-level parallel computation in real-time operating systems
  • Designed and evaluated parallel strategies under strict timing constraints
  • Gained experience bridging systems research with algorithmic design

You may find more and smaller-scale projects or in-class notes I took at https://github.com/zedyelllion?tab=repositories.