亚洲国产爱久久全部精品_日韩有码在线播放_国产欧美在线观看_中文字幕不卡在线观看

Voyager: An Open-Ended Embodied Agent with Large Language Models

1NVIDIA, 2Caltech, 3UT Austin, 4Stanford, 5ASU
*Equal contribution Equal advising
Corresponding authors: guanzhi@caltech.edu, dr.jimfan.ai@gmail.com

Abstract

We introduce Voyager, the first LLM-powered embodied lifelong learning agent in Minecraft that continuously explores the world, acquires diverse skills, and makes novel discoveries without human intervention. Voyager consists of three key components: 1) an automatic curriculum that maximizes exploration, 2) an ever-growing skill library of executable code for storing and retrieving complex behaviors, and 3) a new iterative prompting mechanism that incorporates environment feedback, execution errors, and self-verification for program improvement. Voyager interacts with GPT-4 via blackbox queries, which bypasses the need for model parameter fine-tuning. The skills developed by Voyager are temporally extended, interpretable, and compositional, which compounds the agent's abilities rapidly and alleviates catastrophic forgetting. Empirically, Voyager shows strong in-context lifelong learning capability and exhibits exceptional proficiency in playing Minecraft. It obtains 3.3x more unique items, travels 2.3x longer distances, and unlocks key tech tree milestones up to 15.3x faster than prior SOTA. Voyager is able to utilize the learned skill library in a new Minecraft world to solve novel tasks from scratch, while other techniques struggle to generalize.


Voyager discovers new Minecraft items and skills continually by self-driven exploration, significantly outperforming the baselines.

Introduction

Building generally capable embodied agents that continuously explore, plan, and develop new skills in open-ended worlds is a grand challenge for the AI community. Classical approaches employ reinforcement learning (RL) and imitation learning that operate on primitive actions, which could be challenging for systematic exploration, interpretability, and generalization. Recent advances in large language model (LLM) based agents harness the world knowledge encapsulated in pre-trained LLMs to generate consistent action plans or executable policies. They are applied to embodied tasks like games and robotics, as well as NLP tasks without embodiment. However, these agents are not lifelong learners that can progressively acquire, update, accumulate, and transfer knowledge over extended time spans.

Let us consider Minecraft as an example. Unlike most other games studied in AI, Minecraft does not impose a predefined end goal or a fixed storyline but rather provides a unique playground with endless possibilities. An effective lifelong learning agent should have similar capabilities as human players: (1) propose suitable tasks based on its current skill level and world state, e.g., learn to harvest sand and cactus before iron if it finds itself in a desert rather than a forest; (2) refine skills based on environment feedback and commit mastered skills to memory for future reuse in similar situations (e.g. fighting zombies is similar to fighting spiders); (3) continually explore the world and seek out new tasks in a self-driven manner.

Voyager Components

We introduce Voyager, the first LLM-powered embodied lifelong learning agent to drive exploration, master a wide range of skills, and make new discoveries continually without human intervention in Minecraft. Voyager is made possible through three key modules: 1) an automatic curriculum that maximizes exploration; 2) a skill library for storing and retrieving complex behaviors; and 3) a new iterative prompting mechanism that generates executable code for embodied control. We opt to use code as the action space instead of low-level motor commands because programs can naturally represent temporally extended and compositional actions, which are essential for many long-horizon tasks in Minecraft. Voyager interacts with a blackbox LLM (GPT-4) through prompting and in-context learning. Our approach bypasses the need for model parameter access and explicit gradient-based training or finetuning.



Voyager consists of three key components: an automatic curriculum for open-ended exploration, a skill library for increasingly complex behaviors, and an iterative prompting mechanism that uses code as action space.

Automatic Curriculum


Automatic curriculum. The automatic curriculum takes into account the exploration progress and the agent's state to maximize exploration. The curriculum is generated by GPT-4 based on the overarching goal of "discovering as many diverse things as possible". This approach can be perceived as an in-context form of novelty search.


Skill Library


Skill library. Top: Adding a new skill. Each skill is indexed by the embedding of its description, which can be retrieved in similar situations in the future. Bottom: Skill retrieval. When faced with a new task proposed by the automatic curriculum, we perform querying to identify the top-5 relevant skills. Complex skills can be synthesized by composing simpler programs, which compounds Voyager's capabilities rapidly over time and alleviates catastrophic forgetting.


Iterative Prompting Mechanism


Left: Environment feedback. GPT-4 realizes it needs 2 more planks before crafting sticks. Right: Execution error. GPT-4 realizes it should craft a wooden axe instead of an acacia axe since there is no acacia axe in Minecraft.



Self-verification. By providing the agent's current state and the task to GPT-4, we ask it to act as a critic and inform us whether the program achieves the task. In addition, if the task fails, it provides a critique by suggesting how to complete the task.

Experiments

We systematically evaluate Voyager and baselines on their exploration performance, tech tree mastery, map coverage, and zero-shot generalization capability to novel tasks in a new world.



Significantly Better Exploration

As shown in the first figure, Voyager's superiority is evident in its ability to consistently make new strides, discovering 63 unique items within 160 prompting iterations, 3.3x many novel items compared to its counterparts. On the other hand, AutoGPT lags considerably in discovering new items, while ReAct and Reflexion struggle to make significant progress.

Tech Tree Mastery

Tech tree mastery. The Minecraft tech tree tests the agent's ability to craft and use a hierarchy of tools. Progressing through this tree (wooden tool → stone tool → iron tool → diamond tool) requires the agent to master systematic and compositional skills. In this table, fractions indicate the number of successful trials out of three total runs. Numbers are prompting iterations averaged over three trials. The fewer the iterations, the more efficient the method. Compared with baselines, Voyager unlocks the wooden level 15.3x faster (in terms of the prompting iterations), the stone level 8.5x faster, the iron level 6.4x faster, and Voyager is the only one to unlock the diamond level of the tech tree


Extensive Map Traversal


Map coverage: Two bird's eye views of Minecraft maps. Voyager is able to navigate distances 2.3x longer compared to baselines by traversing a variety of terrains, while the baseline agents often find themselves confined to local areas, which significantly hampers their capacity to discover new knowledge.


Efficient Zero-Shot Generalization to Unseen Tasks


Zero-shot generalization to unseen tasks. We clear the agent's inventory, reset it to a newly instantiated world, and test it with unseen tasks. In the table above, fractions indicate the number of successful trials out of three total runs. Numbers are prompting iterations averaged over three trials. The fewer the iterations, the more efficient the method. Voyager can consistently solve all the tasks, while baselines cannot solve any task within 50 prompting iterations. What's interesting to note is that our skill library constructed from lifelong learning not only enhances Voyager's performance but also gives a boost to AutoGPT. This demonstrates that the skill library serves as a versatile tool that can be readily employed by other methods, effectively acting as a plug-and-play asset to enhance performance.


Ablation Studies


Ablation studies. GPT-3.5 means replacing GPT-4 with GPT-3.5 for code generation. Voyager outperforms all the alternatives, demonstrating the critical role of each component. In addition, GPT-4 significantly outperforms GPT-3.5 in code generation.

Conclusion

In this work, we introduce Voyager, the first LLM-powered embodied lifelong learning agent, which leverages GPT-4 to explore the world continuously, develop increasingly sophisticated skills, and make new discoveries consistently without human intervention. Voyager exhibits superior performance in discovering novel items, unlocking the Minecraft tech tree, traversing diverse terrains, and applying its learned skill library to unseen tasks in a newly instantiated world. Voyager serves as a starting point to develop powerful generalist agents without tuning the model parameters.

Media Coverage

"They Plugged GPT-4 Into Minecraft—and Unearthed New Potential for AI. The bot plays the video game by tapping the text generator to pick up new skills, suggesting that the tech behind ChatGPT could automate many workplace tasks." - Will Knight, WIRED

"The Voyager project shows, however, that by pairing GPT-4’s abilities with agent software that stores sequences that work and remembers what does not, developers can achieve stunning results." - John Koetsier, Forbes

"Voyager, the GTP-4 bot that plays Minecraft autonomously and better than anyone else" - Ruetir

"This AI used GPT-4 to become an expert Minecraft player" - Devin Coldewey, TechCrunch

Coverage Index: [Atmarkit] [Career Engine] [Crast.net] [Daily Top Feeds] [Entrepreneur en Espanol] [Finance Jxyuging] [Forbes] [Forbes Argentina] [Gaming Deputy] [Gearrice] [Haberik] [Head Topics] [InfoQ] [ITmedia News] [Mark Tech Post] [Medium] [MSN] [Note] [Noticias de Hoy] [Ruetir] [Stock HK] [Tech Tribune France] [TechCrunch] [TechBeezer] [Toutiao] [US Times Post] [VN Explorer] [WIRED] [Zaker]

Team

Guanzhi Wang
Yuqi Xie
Yunfan Jiang*
Ajay Mandlekar*

Chaowei Xiao
Yuke Zhu
Linxi "Jim" Fan
Anima Anandkumar

* Equal Contribution   † Equal Advising

BibTeX

@article{wang2023voyager,
  title   = {Voyager: An Open-Ended Embodied Agent with Large Language Models},
  author  = {Guanzhi Wang and Yuqi Xie and Yunfan Jiang and Ajay Mandlekar and Chaowei Xiao and Yuke Zhu and Linxi Fan and Anima Anandkumar},
  year    = {2023},
  journal = {arXiv preprint arXiv: Arxiv-2305.16291}
}
亚洲国产爱久久全部精品_日韩有码在线播放_国产欧美在线观看_中文字幕不卡在线观看

    
    

    9000px;">

      
      

      精品一区二区三区视频| 欧美日韩第一区日日骚| 日韩精品中文字幕一区| 欧美日韩免费观看一区三区| www.在线成人| 国产日产精品一区| 日韩一区二区精品葵司在线| 欧美一区二视频| 欧美国产成人精品| 欧美区一区二区三区| 91精品国产综合久久久蜜臀粉嫩| 日韩视频免费观看高清完整版在线观看 | 欧美三级三级三级爽爽爽| 欧美三级日韩在线| 欧美精品一区二区三区视频| 欧美一区二区三区日韩| 欧美国产日韩a欧美在线观看| 国产成人一级电影| 国产精品羞羞答答xxdd| 日本亚洲欧美天堂免费| 在线观看日韩毛片| 91视频免费播放| 美女性感视频久久| 美女网站在线免费欧美精品| 欧美国产激情一区二区三区蜜月| 精品成人免费观看| 欧美成人高清电影在线| 国产亚洲美州欧州综合国| 精品国产凹凸成av人网站| 国产精品麻豆网站| 亚洲午夜视频在线观看| 日本亚洲天堂网| 日韩成人精品视频| 日本色综合中文字幕| 国内精品久久久久影院薰衣草| 韩国一区二区在线观看| 国产99久久久精品| 97国产精品videossex| 日韩中文字幕av电影| 欧美人与禽zozo性伦| 久久久久国产成人精品亚洲午夜| 亚洲国产欧美日韩另类综合| 欧美喷水一区二区| 日韩av在线发布| 欧美久久久久久久久中文字幕| 樱花草国产18久久久久| 一区二区三区四区不卡在线 | 日韩激情一二三区| 国产成人在线视频网站| 欧美日韩国产一级| 日本一区二区在线不卡| 亚洲欧美日韩一区二区| 国产在线视频精品一区| 欧美日韩成人综合| 一区二区三区欧美久久| 91视视频在线直接观看在线看网页在线看| 麻豆一区二区在线| 欧美人牲a欧美精品| 中文字幕欧美一区| 另类的小说在线视频另类成人小视频在线| jizz一区二区| 亚洲免费观看高清| 欧美日韩一区二区三区免费看| 亚洲国产精品ⅴa在线观看| 成人激情小说网站| 日韩免费观看高清完整版在线观看| 亚洲小少妇裸体bbw| 欧美日韩大陆在线| 久久精品国产精品青草| 久久久亚洲欧洲日产国码αv| 国产精品一卡二卡| 国产精品不卡视频| 欧美视频日韩视频在线观看| 日韩亚洲欧美综合| 丁香六月综合激情| 亚洲丝袜美腿综合| 欧美欧美午夜aⅴ在线观看| 日韩主播视频在线| 国产亚洲女人久久久久毛片| 成人免费毛片a| 亚洲一区二区三区影院| 精品美女在线观看| 日韩福利视频网| 国产日韩欧美不卡| 色婷婷综合在线| 日本不卡的三区四区五区| 亚洲美女屁股眼交| 亚洲色图欧洲色图| 亚洲伦在线观看| 亚洲一二三四久久| 国产精品不卡在线| 亚洲色欲色欲www在线观看| 日韩一区日韩二区| 国产精品午夜在线| 亚洲另类在线制服丝袜| 亚洲一区二区欧美| 亚洲已满18点击进入久久| 国产精品麻豆欧美日韩ww| 亚洲精品国产视频| 亚洲制服丝袜av| 美腿丝袜亚洲三区| 蜜桃视频免费观看一区| 国产一区二区福利| 国产宾馆实践打屁股91| 在线国产亚洲欧美| 欧美色图在线观看| 91精品国产美女浴室洗澡无遮挡| 精品久久人人做人人爽| 中文字幕精品一区二区三区精品| 国产精品久久久久影院亚瑟| 亚洲男女一区二区三区| 久久国产麻豆精品| 成人黄色免费短视频| 日韩vs国产vs欧美| 欧美三电影在线| 国产精品免费久久| 韩国女主播成人在线| 在线观看一区日韩| 亚洲色图丝袜美腿| 国产老肥熟一区二区三区| 色婷婷av一区二区三区软件| 日韩欧美国产小视频| 依依成人综合视频| 成人av免费观看| 国产精品久久久久久久久免费桃花| 久久国产精品99久久人人澡| 欧美日韩在线电影| 亚洲激情五月婷婷| 欧美日韩久久久久久| 日韩在线一区二区| 日韩免费看网站| 国产福利不卡视频| 国产亚洲短视频| 国产传媒一区在线| 久久久www免费人成精品| 国产一区二区中文字幕| xvideos.蜜桃一区二区| 亚洲成人黄色影院| 欧美成人精品高清在线播放| 国内成人自拍视频| 久久久久国产精品麻豆ai换脸 | 香蕉久久一区二区不卡无毒影院| 99精品在线免费| 亚洲一区av在线| 日韩女优视频免费观看| 国产精品一区在线观看乱码 | 91亚洲永久精品| 日韩成人午夜精品| 久久综合成人精品亚洲另类欧美| 极品少妇xxxx精品少妇| 亚洲国产你懂的| 日本一区二区三区高清不卡 | 午夜精品久久久久| 一区二区三区毛片| 国产欧美精品一区二区三区四区 | 91精品国产综合久久久蜜臀图片| 国产成人一级电影| 国内精品嫩模私拍在线| 午夜欧美在线一二页| 亚洲欧美日韩国产综合| 欧美国产日韩a欧美在线观看 | 日韩一卡二卡三卡四卡| 欧美性做爰猛烈叫床潮| 免费成人你懂的| 视频一区视频二区中文字幕| 亚洲国产美国国产综合一区二区| 樱桃视频在线观看一区| 一区二区三区日韩精品视频| 国产精品福利av| 国产精品国产精品国产专区不蜜 | 亚洲欧洲另类国产综合| 99视频一区二区| 丝袜诱惑制服诱惑色一区在线观看| 日韩一级在线观看| 99久久国产综合精品麻豆| 日韩理论电影院| 欧美日韩免费观看一区二区三区 | 色呦呦国产精品| 视频一区欧美日韩| 日韩欧美在线网站| 欧美成人福利视频| 三级欧美在线一区| 亚洲国产成人私人影院tom| 欧美性色综合网| 国产sm精品调教视频网站| 亚洲一区二三区| 亚洲欧美一区二区三区极速播放 | 中文字幕精品一区二区三区精品| 欧美日韩国产中文| 欧美中文字幕亚洲一区二区va在线| 一本色道a无线码一区v| 91在线视频在线| 色偷偷久久一区二区三区| 欧美日韩国产不卡| 在线观看91av| 91麻豆视频网站| 久久成人羞羞网站| 午夜久久电影网| 亚洲成人中文在线| 精品一区二区三区久久|