Deploy Agentic-Ready AI at the Edge with Memory Efficiency in NVIDIA JetPack 7.2
Mirrored from NVIDIA Developer Blog for archival readability. Support the source by reading on the original site.
Deploy Agentic-Ready AI at the Edge with Memory Efficiency in NVIDIA JetPack 7.2
Release features include one-command deployment of NVIDIA NemoClaw, NVIDIA agent skills for Jetson, official Yocto Project support, Super Mode on Jetson AGX Orin 32 GB
AI-Generated Summary
- NVIDIA JetPack 7.2 enhances Jetson devices with one-command deployment of NemoClaw, enabling easy development of secure, agentic AI applications for robotics, industrial automation, and edge AI systems.
- The update includes Jetson agent skills that automate development tasks like Linux customization, memory optimization, and model benchmarking, accelerating time to market and reducing complexity.
- JetPack 7.2 introduces Multi-Instance GPU (MIG) support on Jetson Thor for isolated, predictable multiworkload execution, critical for real-time AI applications in robotics and industrial automation.
AI-generated content may summarize information incompletely. Verify important information. Learn more
As AI agents move from the digital world to the physical environment, they can readily use NVIDIA Jetson to accelerate real-world deployment with optimized memory and performance.
NVIDIA JetPack 7.2 directly supports one-command deployment of NVIDIA NemoClaw, an open source stack that adds privacy and security controls to OpenClaw. It introduces NVIDIA agent skills for Jetson—Jetson device-side skills and Jetson BSP skills—and extends the latest compute stack and agentic capabilities to NVIDIA Jetson Orin. The Jetson software-defined platform makes this possible: the same hardware continues to deliver more value with every software release.
This post introduces new JetPack 7.2 release features and capabilities, which also include:
- NVIDIA Multi-Instance GPU (MIG) support on NVIDIA Jetson Thor for deterministic multiworkload execution
- Official Yocto Project support for custom Linux distributions that can further improve system efficiency
- Super Mode for Jetson AGX Orin 32 GB for higher AI performance and greater cost efficiency at the edge
Together, these updates help developers get more out of existing Jetson hardware, accelerate time to market, and lower total cost of ownership.
How is NVIDIA JetPack 7.2 software agentic-ready?
With JetPack 7.2, Jetson is NemoClaw-ready out of the box. JetPack 7.2 comes preconfigured with the required dependencies and software stack, so you can deploy and run NemoClaw-based workflows on Jetson without manual environment setup. This enables you to easily build agentic physical AI applications across robotics, industrial automation, vision agents, and edge AI systems.
To install NemoClaw on a Jetson device running JetPack 7.2, run the following single command:
curl -fsSL nvidia.com/nemoclaw.sh | bash
NVIDIA agent skills for Jetson in JetPack 7.2
JetPack 7.2 also provides developers with Jetson agent skills to build and optimize Jetson software stacks using AI agents. The agent skills are a set of repeatable, agent-executable instructions that define which tools to call, what outputs to produce, and how to validate results. Rather than manually configuring each step of the development process, developers can leverage agent skills through an agent to handle these tasks automatically.
Jetson agent skills apply this pattern specifically to Jetson software development workflows. These agent-driven workflows help automate common development tasks such as Jetson Linux customization, memory optimization, model benchmarking, and deployment configuration. With both device-side and BSP-side implementations, developers can use agent skills to reduce development complexity and accelerate the path from prototyping to production deployment on Jetson platforms.
JetPack 7.2 ships three categories of skills:
- Jetson Linux customization skills: Guide an agent to build and customize a BSP from scratch for custom carrier boards. This includes configuring I/Os, clock settings, fan control, power profiles, or any other module for a specific hardware design. Tasks that previously required weeks of manual effort can be handled by an agent, reducing time to market for custom Jetson designs.
- Memory optimization skills: Optimize memory usage across the software stack. These skills can tune the whole stack starting bootloader memory carveouts, optimize kernel memory reservation, reduce redundant user space processes, and help build the most memory-efficient software configuration for a given workload. This directly reduces TCO by enabling more capable workloads to run on lower memory configurations.
- Model benchmarking skills: Help you identify the best model configuration for your use case. These skills cover model benchmarking, inference optimization, and Jetson diagnostics. For example, a developer building a NemoClaw-based application can use these skills to determine which model runs most efficiently on their target device for their specific task.
Along with these three categories of skills, NVIDIA is also introducing skills that help agents build vision pipelines using NVIDIA DeepStream and NVIDIA Metropolis Blueprint for Video Search and Summarization (VSS).
To learn more and get started, check out Jetson device-side skills and Jetson BSP skills on GitHub.
MIG on Jetson Thor enables GPU partitioning for mixed-criticality workloads
JetPack 7.2 on Jetson Thor introduces support for MIG, allowing the integrated NVIDIA Blackwell GPU to be partitioned into two isolated GPU instances with dedicated compute, cache, and memory bandwidth. This enables multiple AI workloads to run concurrently with predictable performance and minimal interference.
Combined with the Preemptible RT kernel in JetPack 7, MIG helps create a more deterministic execution environment for mixed-criticality systems. Workload determinism is critical for physical AI systems such as humanoid robots, autonomous machines, industrial automation, and medical devices. This is because perception, planning, control, generative AI, and safety workloads often share a single SoC, where resource contention can introduce latency jitter into time-sensitive pipelines.
With MIG on Jetson Thor, developers can dedicate GPU resources to latency-sensitive robotics workloads while running best-effort AI inference or generative AI models on a separate partition. This helps maintain predictable latency and quality of service for workloads such as perception, sensor fusion, motion planning, and safety monitoring. JetPack 7.2 supports two MIG partitions on Jetson Thor:
- A larger AI and graphics partition for inferencing, rendering, visualization, and general NVIDIA CUDA workloads (12 SMs, 1536 CUDA cores)
- A second isolated compute partition for robotics, control, perception, or safety-critical workloads (8 SMs, 1024 CUDA cores)
Applications, containers, and services can be assigned to specific MIG partitions using standard CUDA Runtime controls and NVIDIA Container Toolkit integration. This is especially important for next-generation humanoid robotics running multiple AI pipelines across different timing domains, where control loops, AI perception, and generative AI reasoning must reliably coexist on a single embedded platform.
By bringing data-center-class GPU partitioning to embedded AI computing, JetPack 7.2 enables more capable edge AI systems with improved predictability and reliability for real-world deployment. Read more about MIG on Jetson Thor.
Introducing Yocto Project support on NVIDIA Jetson
Starting with JetPack 7.2, NVIDIA provides official Yocto Project support on Jetson, including validated recipes and reference images for Jetson developer kits. The Yocto Project is an open source Linux Foundation project that provides tools to build custom Linux distributions for embedded hardware architectures.
NVIDIA now leads roadmap contributions with a regular release cadence to the OE4T layer. NVIDIA owns the CI/CD pipeline, SQA, and releases validated reference images for Jetson developer kits. And developers have access to technical documentation and dedicated forums support.
The Yocto Project brings three core benefits to Jetson developers:
- Customizability: Enables you to build tightly tailored images that include only the required services, drivers, and libraries, rather than adapting the NVIDIA Ubuntu L4T image. This reduces memory footprint and optimizes system performance for the target application.
- Reproducibility: Yocto Project produces identical image builds across runs, simplifying debugging, testing, and certification workflows. This is especially valuable in regulated fields such as medical and industrial deployments.
- Open ecosystem. Access thousands of recipes and community layers for AI frameworks, industrial protocols, and custom middleware.
To help you decide when to use L4T/JetPack versus OE4T/Yocto Project, refer to the Developer Decision Guide in Figure 3.
With the official support of Yocto Project on Jetson, NVIDIA has also built a robust ecosystem of distribution partners, ISVs, and ODMs to accelerate and simplify Yocto Project development on Jetson platforms. These partners provide a range of offerings including production-ready Linux distributions, BSP customization, long-term support, fleet management solutions, multimedia and ISP expertise, and security-focused integrations.
Companies such as Konsulko Group and Peridio offer complete OS solutions like Konsulko Orca OS and Avocado OS, while Balena focuses on container-based fleet management and deployment at scale. Other NVIDIA partners include Neurealm, RidgeRun, and Wind River, who provide extensive engineering and NRE services with deep expertise in embedded Linux, BSP customization, multimedia pipelines, and long-term platform support. Together, this ecosystem enables developers to rapidly deploy, customize, and scale Yocto-based solutions on Jetson.
In addition to distribution partners and ISVs, NVIDIA also works closely with a strong ecosystem of partners to help customers accelerate product development and deployment on Jetson platforms such as AAEON, Advantech, Antmicro, ASUS, AVerMedia, Connect Tech, EDOM, and YUAN provide a wide range of hardware solutions including carrier boards, edge AI systems, industrial embedded platforms, video capture solutions, and reference designs optimized for Jetson. These partners enable developers to rapidly prototype and scale production-ready AI and edge computing solutions with hardware platforms tailored for robotics, industrial automation, smart cities, healthcare, retail, and other embedded AI applications.
Unifying the Jetson stack and unlocking more performance
JetPack 7.2 extends the Ubuntu 24.04, kernel 6.8 and CUDA Toolkit 13.0-based compute stack (introduced with Jetson Thor) to the Jetson Orin family, bringing both platforms onto a single unified software foundation. With a common stack across Orin and Thor, you can seamlessly deploy the latest AI applications across the entire Jetson portfolio while taking advantage of the newest CUDA capabilities, libraries, and performance optimizations.
This unified approach significantly reduces the engineering effort required to support multiple hardware platforms, simplifying application development, validation, deployment, and long-term fleet maintenance.
JetPack 7.2 also introduces a new Super Mode for Jetson AGX Orin 32 GB, unlocking higher GPU and power configurations that bring its performance much closer to Jetson AGX Orin 64 GB. By increasing GPU frequencies from 930 MHz to 1.3 GHz and enabling higher power envelopes up to 60W, Super Mode boosts AI performance from 200 TOPS to 241 TOPS, a more than 20% increase over the standard AGX Orin 32 GB configuration.
This enhancement enables customers to achieve near-flagship AGX Orin 64 GB performance using Jetson AGX Orin 32 GB, while reducing module cost by 45%. The new Super Mode makes the 32 GB module a cost-effective choice for generative AI, robotics, and edge AI deployments.
| Jetson AGX Orin 32 GB | Jetson AGX Orin 32GB Super | Jetson AGX Orin 64 GB | |
| Nemotron3 Nano 30B A3B | 31 | 37 | 40 |
| Cosmos Reason 2 8B | 9 | 10 | 10 |
| Qwen 3.5 4B | 24 | 27 | 28 |
| Qwen 3.5 9B | 13 | 15 | 17 |
| Qwen 3.6 27B | 4 | 5 | 7 |
| Gemma 4 E4B | 25 | 29 | 32 |
Get started with NVIDIA JetPack 7.2
JetPack 7.2 delivers more value from the same Jetson hardware through software. As agentic AI moves to the edge and memory costs remain a real constraint in production deployments, this release directly addresses both.
Features include one-command deployment of NVIDIA NemoClaw, memory and workflow optimization agent skills for Jetson, official Yocto Project support for lean and reproducible production builds, and MIG on Jetson Thor for deterministic multiworkload execution. With JetPack 7.2, you can do more on existing hardware while building toward increasingly capable agentic workloads at the edge.
Download JetPack 7.2 to get started deploying agentic AI at the edge. For questions and community support, visit the NVIDIA Developer Forum.
Join NVIDIA founder and CEO Jensen Huang for the NVIDIA GTC Taipei 2026 Keynote and learn more with related sessions.
Tags
About the Authors
Peilun Tsai is a senior product marketing manager for Edge AI at NVIDIA. With experience spanning GTM strategy, product management, and sales in edge AI, enterprise software, and vision AI, she works to connect technical innovation market adoption. Peilun holds an MBA from Duke University.
Shashank Maheshwari is the product manager for Jetson and IGX software at NVIDIA. He holds an MBA from Duke University and a bachelor’s degree in Electronics from BITS Pilani.
Comments
More from NVIDIA Developer Blog
-
Run Local AI Agents with Faster Models and Multi-Node Clustering on NVIDIA DGX Spark
Jun 1
-
How to Post-Train Autonomous Vehicle Models in Closed-Loop with NVIDIA Alpamayo
Jun 1
-
Develop Physical AI Reasoning, World, and Action Models with NVIDIA Cosmos 3
Jun 1
-
Advancing AI Infrastructure for Agentic AI with NVIDIA DOCA In-Silicon Security
Jun 1
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.