New AI Agent Improves Web Automation by Reusing Interaction Patterns

Shiqi He, Yue Cui, Feijie Wu, Xinyu Ma, Jiaheng Lu, Yaliang Li, Bolin Ding, Mosharaf Chowdhury· June 17, 2026 View original

Summary

SkillMigrator is an AI agent that learns and transfers reusable web skills across different websites by matching layout structure rather than specific element references. This approach, using "transferable interaction patterns," significantly reduces the number of LLM actions needed for web automation tasks, improving efficiency and cost-effectiveness.

Large language model (LLM) web agents typically operate by observing a web page and then executing low-level tool actions. This method can lead to long action sequences and high computational costs, especially for complex tasks. To mitigate this, previous systems have introduced "web skills," which are higher-level callable tools that encapsulate repeated interaction fragments. However, these prior skill libraries often suffer from low reuse across different websites because they rely on instruction similarity or coarse site metadata for triggering. Researchers have developed SkillMigrator, an agent designed to overcome these limitations. SkillMigrator learns reusable web skills and effectively transfers them across various sites by focusing on matching layout structure rather than specific element references. Each learned skill is stored as a "transferable interaction pattern" (TIP), which combines the skill with a structural sketch of the web page at the time of its induction. During execution, SkillMigrator retrieves relevant TIPs based on layout similarity and then grounds their references onto the live web page. This innovative approach has been shown to reduce the average number of LLM actions on successful trajectories by 8-10% across benchmarks like WebArena and Mind2Web, all while maintaining comparable success rates. This advancement offers a more efficient and cost-effective way to automate complex web interactions.

Why it matters

This research offers a significant leap in web automation efficiency and robustness for professionals developing AI agents. By enabling skills to transfer across diverse websites, it reduces development time, operational costs, and the need for site-specific agent retraining.

How to implement this in your domain

  1. 1Explore integrating layout-based skill transfer mechanisms into existing web automation frameworks.
  2. 2Develop or adapt tools to create "transferable interaction patterns" from successful user trajectories on web applications.
  3. 3Benchmark current LLM-based web agents against SkillMigrator's approach to identify potential efficiency gains.
  4. 4Train AI agents using these transferable patterns to improve performance on new or unseen web interfaces.
  5. 5Consider how this technology could enhance customer service chatbots or data extraction tools that interact with various web portals.

Who benefits

Software DevelopmentE-commerceCustomer ServiceData AnalyticsQA Testing

Key takeaways

  • SkillMigrator improves web agent efficiency by reusing interaction patterns across sites.
  • It matches layout structure instead of specific element references for skill transfer.
  • The approach reduces LLM action counts and associated latency and cost.
  • This enables more robust and adaptable web automation for complex tasks.

Original post by Shiqi He, Yue Cui, Feijie Wu, Xinyu Ma, Jiaheng Lu, Yaliang Li, Bolin Ding, Mosharaf Chowdhury

"arXiv:2606.17645v1 Announce Type: new Abstract: Large language model (LLM) web agents are usually deployed as tool callers: each turn, the model reads a fresh page observation and emits one structured tool action. When every action is a low-level primitive, horizons grow quickly…"

View on X

Originally posted by Shiqi He, Yue Cui, Feijie Wu, Xinyu Ma, Jiaheng Lu, Yaliang Li, Bolin Ding, Mosharaf Chowdhury on X · view source

Want to go deeper?

Turn these trends into skills with Learnijoy's hands-on AI & tech courses.

Explore courses