extreme1228

Follow

Lv Bowen extreme1228

Follow

PhD candidate at Tsinghua University

9 followers · 6 following

Tsinghua University
Beijing

Achievements

Achievements

Highlights

Pro

extreme1228/README.md

👋 Hi, I'm Bowen Lv

Ph.D. Student @ Tsinghua University (Since 2025)

🎓 Education

Ph.D. in Computer Science, Tsinghua University (2025–)
B.Eng. in Computer Science, Tongji University, 2021–2025

🔬 Research Interests

Large Language Model (LLM) post-training, Reinforcement Learning (RL), and GUI / Computer-Use Agents.

💡 Current Research

ScaleCUA — scaling computer-use agents with verifiable task synthesis and efficient online RL (open-source SOTA on OSWorld / ScienceBoard).
Agentic RL systems — asynchronous, train–inference-decoupled online RL for multi-task, multi-turn agents.
One year at Zhipu AI on large-model post-training and GUI Agent RL.

📝 Publications

Bowen Lv, Zehan Qi ScaleCUA: Scaling Computer Use Agents with Verifiable Task Synthesis and Efficient Online RL (first author) Preprint, 2026. — 68.7% OSWorld / 54.0% ScienceBoard (open-source SOTA)
Xueqiao Sun, Xiao Liu, Bowen Lv, et al. KARL: Reinforcement Learning for LLM Agents on Multi-Turn Knowledge-Intensive Agentic Tasks ACL 2026 main conference.
Wenyi Hong, Xiaotao Gu, ... , Bowen Lv, ... , GLM-V Team GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Preprint, 2026.
[arxiv]
Hanchen Zhang, Xiao Liu, Bowen Lv, et al.
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework.
Preprint, 2025.
[arxiv]
Di Zhang, Bowen Lv, et al.
Focus on What Matters: Separated Models for Visual-Based RL Generalization.
NeurIPS 2024.
[arxiv]

📝 Blogs

🔗 Links

📫 Contact 📧 1486404293@qq.com / lvbowen1228@gmail.com 🔗 GitHub Profile
📄 Google Scholar

Popular repositories Loading

CS-IN-TJ CS-IN-TJ Public

同济大学计算机系学习资料，包含课程笔记、代码和项目，欢迎参考与贡献！

Jupyter Notebook 7
ACM-template ACM-template Public

C++ 1
ocean_compute_new ocean_compute_new Public

JavaScript 1
extreme1228 extreme1228 Public

HTML 1
deep-learning-hw deep-learning-hw Public

Jupyter Notebook
FileSystem FileSystem Public

C++