CLOUDADV Optimizes Cloud VM Sizing with Zero-Shot AI Forecasting.

Jack Bell, Giacomo Carfi, Gerlando Gramaglia, Andrea Simioni, Daniele Fontani, Vincenzo Lomonaco· July 1, 2026 View original

Summary

CLOUDADV is an advisory system that uses zero-shot foundation models for time-series forecasting to help engineers right-size cloud virtual machines, significantly reducing costs and operational inefficiency even with workload changes. It provides decision-aligned recommendations by considering historical data, forecasts, pricing, and heuristics.

Cloud virtual machines are frequently overprovisioned, leading to unnecessary expenses and operational inefficiencies. A new interactive advisory system, CLOUDADV, aims to address this by assisting engineers in optimizing cloud instance sizing, even when workloads fluctuate. CLOUDADV leverages zero-shot time-series forecasting combined with bounded recommendation generation, offering planning horizons across days, weeks, and months. For each query, the system builds a comprehensive decision context, incorporating historical utilization, forecast summaries, current VM metadata, available instance options, pricing, and explicit sizing heuristics. The system uses a powerful LLM offline to generate reference recommendations, while a smaller production model is evaluated for deployment-time alignment, considering latency and cost. Case studies demonstrate substantial simulated monthly cost savings (over 50%) with minimal exceedance rates, indicating that zero-shot foundation models can effectively support decision-aligned provisioning in dynamic cloud environments, reducing the need for constant per-tenant retraining.

Why it matters

Cloud professionals can use this system to significantly cut infrastructure costs and improve efficiency by accurately sizing VMs, especially in dynamic environments where workloads drift.

How to implement this in your domain

  1. 1Evaluate current cloud VM utilization patterns to identify overprovisioning.
  2. 2Explore integrating zero-shot time-series forecasting models for workload prediction.
  3. 3Develop an advisory system that combines forecasts with pricing and instance options for sizing recommendations.
  4. 4Implement a feedback loop to monitor recommendation accuracy and cost savings.

Who benefits

Cloud ComputingIT ServicesE-commerceSaaSFinance

Key takeaways

  • CloudADV helps optimize VM sizing, reducing overprovisioning and costs.
  • It uses zero-shot foundation models for time-series forecasting under workload drift.
  • The system provides decision-aligned recommendations considering various factors.
  • Significant cost savings and reduced operational burden are achievable.

Original post by Jack Bell, Giacomo Carfi, Gerlando Gramaglia, Andrea Simioni, Daniele Fontani, Vincenzo Lomonaco

"arXiv:2606.31470v1 Announce Type: new Abstract: Cloud virtual machines are often overprovisioned, creating avoidable cost and operational inefficiency. We present CLOUDADV, an interactive engineer-facing advisory system for cloud instance sizing under workload drift. The system c…"

View on X

Originally posted by Jack Bell, Giacomo Carfi, Gerlando Gramaglia, Andrea Simioni, Daniele Fontani, Vincenzo Lomonaco on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses

More in AI Engineering & DevTools

AI ResearchAI Engineering & DevTools

Philosophical Foundations for Explainable AI in Healthcare Explored

This paper critically reviews the intersection of philosophy of science and explainable AI (XAI) in health sciences, examining what constitutes an adequate medical explanation. It identifies causality, trust, and epistemic adequacy as central axes for designing robust XAI systems in clinical decision-making.

Martina Mattioli, Marcello PelilloJul 1, 2026
AI ResearchAI Engineering & DevTools

New Metric Improves LLM Reinforcement Learning with Verifiable Rewards.

This research introduces the Relative Surprisal Index (RSI), an information-theoretic metric for adaptive token selection in Reinforcement Learning with Verifiable Rewards (RLVR) for LLMs. RSI-S, an entropy-adaptive filtering method based on RSI, improves reasoning accuracy by 2-3 percentage points by retaining tokens within a stable surprisal interval.

Outongyi Lv, Yanzhao Zheng, Yuanwei Zhang, Zhenghao Huang, Xingjun Wang, Baohua Dong, Hangcheng Zhu, Yingda ChenJul 1, 2026
AI Engineering & DevToolsAI Research

New ACE Module Boosts LLM Agent Context Management

Researchers introduce ACE (Adaptive Context Elasticizer), a plug-and-play module that dynamically manages historical information for LLM-based agents. ACE maintains a lossless message layer and adaptively orchestrates context, significantly improving performance across various agent frameworks without architectural changes.

Ning Liao, Zihao Long, Xiaoxing Wang, Xue Yang, Yaoming Wang, Ziyuan Zhuang, Xunliang Cai, Rongxiang Weng, Junchi YanJul 1, 2026