Posts by Collection

portfolio

Road Detection and Autonomous Driving

发布时间:

Research and development on road detection and autonomous driving systems, focusing on computer vision and perception for autonomous vehicles.

Robot Reinforcement Learning

发布时间:

Research on reinforcement learning for robotic control and manipulation, developing intelligent agents for robotic tasks.

Text-to-Scene Generation

发布时间:

Generating 3D scenes from natural language descriptions, bridging the gap between language understanding and 3D scene representation.

openRLHF and lightRLHF Framework Improvements

发布时间:

Algorithm improvements and enhancements for the openRLHF reinforcement learning framework, including lightRLHF - a lightweight version based on openRLHF improvements.

veRL Framework Improvements

发布时间:

Framework integration and improvements for veRL (Volcano Engine Reinforcement Learning), a flexible, efficient and production-ready RL training library for large language models.

Research MCP Servers

发布时间:

A collection of Model Context Protocol (MCP) servers for research workflows, including arXiv integration and other research tools.

Multi-Agent Research Assistant

发布时间:

An intelligent multi-agent system for research assistance, enabling collaborative problem-solving among multiple AI agents.

publications

Paper Title Number 4

Published in GitHub Journal of Bugs, 2024

This paper is about fixing template issue #693.

Recommended citation: Your Name, You. (2024). "Paper Title Number 3." GitHub Journal of Bugs. 1(3).
Download Paper

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45° Law

Published in arXiv, 2025

We introduce SafeWork-R1, a cutting-edge multimodal reasoning model that demonstrates the coevolution of capabilities and safety. It is developed by our proposed SafeLadder framework, which incorporates large-scale, progressive, safety-oriented reinforcement learning post-training, supported by a suite of multi-principled verifiers.

Recommended citation: Shanghai AI Lab et al. (2025). "SafeWork-R1: Coevolving Safety and Intelligence under the AI-45° Law." arXiv preprint arXiv:2507.18576.
Download Paper

talks

Talk 1 on Relevant Topic in Your Field

发布时间:

This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.