Prompting Enables Post-Launch AI Expansion for Spacecraft Inspection

Nicholas A. Welsh, Lennon J. Shikhman, Monty Nehru Attazs, Seemanthini K. Putane, Van Minh Nguyen, Ryan T. White· June 16, 2026 View original

Summary

This research explores using prompt-driven vision-language models to enable post-launch semantic expansion for spaceborne inspection systems. It demonstrates that new spacecraft components can be specified via natural language prompts without modifying onboard model weights, achieving significant instance segmentation performance on previously unseen satellites.

Spaceborne inspection systems typically deploy perception models before launch, making subsequent updates to model weights or expansion of fixed label sets operationally challenging. This paper investigates a novel approach: using prompt-driven vision-language models to enable post-launch semantic expansion. The goal is to allow new spacecraft components to be identified and specified through natural language prompts, eliminating the need to modify or re-upload onboard model parameters. The study evaluates the zero-shot instance segmentation capabilities of a model (SAM3) on a test set of 129 images of previously unseen satellites, adhering to a strictly frozen, single-pass inference protocol. Under fixed global thresholds and without post-processing, SAM3 achieved a mean Average Precision (mAP) of 0.385 at an Intersection over Union (IoU) threshold of 0.5, and 0.267 across IoU thresholds from 0.5 to 0.95. Performance was found to be strongly dependent on the scale of the objects; large structural elements like spacecraft bodies and solar arrays were reliably localized, while smaller appendages such as antennas and thrusters remained more difficult. Crucially, prompt formulation significantly influenced performance, with structured prompts incorporating spatial and geometric descriptors yielding up to an 82% improvement over simple category-name prompts. The model operates within the memory and compute constraints of contemporary embedded GPUs, suggesting that prompt-driven grounding offers a practical mechanism for extending the semantic capabilities of spacecraft inspection systems post-launch, while also highlighting current limitations for fine-scale component localization under orbital domain shift.

Why it matters

For aerospace engineers, satellite operators, and AI developers in constrained environments, this research offers a groundbreaking method for dynamically updating AI capabilities without costly and complex redeployments. It enables greater adaptability and longevity for autonomous systems operating in remote or inaccessible locations.

How to implement this in your domain

  1. 1Explore integrating prompt-driven vision-language models into perception systems for remote or embedded applications where model updates are difficult.
  2. 2Design structured natural language prompts that incorporate spatial and geometric descriptors for improved object detection and segmentation.
  3. 3Develop strategies for fine-tuning prompt engineering to maximize performance for specific object types or environmental conditions.
  4. 4Evaluate the computational and memory footprint of such models to ensure compatibility with embedded hardware constraints.
  5. 5Consider this approach for expanding the capabilities of deployed autonomous systems in domains like infrastructure inspection, defense, or remote sensing.

Who benefits

AerospaceSatellite OperationsRoboticsDefenseRemote Sensing

Key takeaways

  • Post-launch AI capability expansion is challenging for spaceborne systems.
  • Prompt-driven vision-language models enable semantic expansion without weight updates.
  • Zero-shot instance segmentation shows promise for spacecraft components.
  • Structured prompt formulation significantly improves detection performance.

Original post by Nicholas A. Welsh, Lennon J. Shikhman, Monty Nehru Attazs, Seemanthini K. Putane, Van Minh Nguyen, Ryan T. White

"arXiv:2606.15427v1 Announce Type: new Abstract: Spaceborne inspection systems often deploy perception models prior to launch, after which updating model weights or expanding fixed label sets becomes operationally impractical. While supervised models can be integrated pre-flight,…"

View on X

Originally posted by Nicholas A. Welsh, Lennon J. Shikhman, Monty Nehru Attazs, Seemanthini K. Putane, Van Minh Nguyen, Ryan T. White on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses