[2025-11] InternManip-Eval: A universal, easy-to-use, and efficient framework for embodied-manipulation tasks across diverse benchmarks.

Note: To protect commercial confidentiality, public information is limited.

Project Overview

This is the clean, evaluation-only version extracted from the InternManip project, which was refactored by an individual.

A universal, easy-to-use, and efficient evaluation framework, designed for embodied manipulation tasks across various benchmarks. It currently supports the Calvin, SimplerEnv, and GenManip benchmarks, and also includes ARX LIFT2 real-robot control (from the IROS offline competition, though the prerequisite robot service launch code is not included).

🏠 Highlights

Unified
- Evaluations for different benchmarks are integrated into a single framework, and it is easy to extend with additional ones.
Easy to Use
- All evaluation settings are consolidated into one configuration file.
- One-command Python dependency installation.
- C-S(client–server) evaluation structure that decouples the agent from the environment, allowing each to use its own dependencies. Don’t be intimidated by the C-S setup——codebase supports automatic c–s evaluation launch, while also allowing manual launch (useful when deploying the agent on different nodes or when specifying a custom port for the agent server).
Efficient
- Supports distributed evaluation acceleration (and again, users don’t need to worry about the underlying implementation).

Framework

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Jiantong Chen

Project Overview

🏠 Highlights

Framework

Share on