נושא הפרוייקט

מספר פרוייקט מחלקה שמות סטודנטים אימייל שמות מנחים

CompareRL

CompareRL

תקציר בעיברית

הפרויקט שלנו נועד להעריך ולהשוות סוכני AI שונים שהוכשרו על נתונים מתחרויות MineRL כדי לזהות את הגישה היעילה ביותר למשחק ה- Minecraft. תחרויות אלו מתמקדות בפיתוח סוכנים sample-efficient למשחק ה- Minecraft, וניתן למצוא בספרות מגוון אלגוריתמים וגישות שונות, כולל AI היררכי, Imitation Learning, ו-Behavioral Cloning. עם זאת, קיים חוסר במחקר השוואתי בתחום זה. על מנת למלא את הפער הזה, ניישם שני אלגוריתמים מובילים מהתחרויות, האחד מבוסס על Imitation Learning, והשני מבוסס על Behavioral Cloning. בנוסף לאלגוריתמים הקיימים, נציג אלגוריתם חדש וייחודי אשר בוחן כיוון אחר ממה שנעשה בתחרויות, תוך התמקדות ספציפית ב-Planning במקום להסתמך על למידת חיזוק (RL). האלגוריתם ינתח את תתי המטרות של הסוכן לאורך המשחק וילמד את הפעולות הרלוונטיות לכל תת מטרה. נרצה להבין ולבחון ביצועים של אלגוריתם שפועל בגישת Planning לעומת אלגוריתמים אחרים מבוססי RL. כל האלגוריתמים יתמקדו במשימה אחת שתחרויות MineRL מציעות שהיא השגת Iron Pickaxe (ObtainIronPickaxe). על ידי יישום והערכה של מגוון האלגוריתמים בסביבת המחקר הייחודית שלנו, נוכל להשוות את הביצועים שלהם זה מול זה. בנוסף, נעריך את הביצועים שלהם מול האלגוריתם הייחודי החדש שלנו. על ידי ביצוע הערכה מקיפה זו, המחקר נועד לתרום לגוף הידע הקיים ולספק תובנות לגבי היעילות של גישות AI שונות בתחום Minecraft.

תקציר באנגלית

Our project aims to evaluate and compare different AI agents trained on data from the MineRL competitions to identify the most effective approach for playing Minecraft. These competitions focus on developing sample-efficient agents for playing Minecraft, and various algorithms and approaches have been utilized, including Hierarchical AI, Imitation Learning, and Behavioral Cloning. However, there is a lack of comparative research in this area. To overcome this gap, we will implement two leading algorithms from the competitions, one based on Imitation Learning, and the other based on Behavioral Cloning. In addition to these state-of-the-art algorithms, we will introduce a new and unique algorithm that takes a different direction from what was done in the competitions, specifically focusing on Planning instead of relying on Reinforcement Learning (RL). It will analyze the sub-goals of the agent throughout the game and learn the actions relevant to each sub-goal. We are interested in understanding how an algorithm that exclusively operates on Planning principles can perform in the Minecraft environment. All the algorithms will focus on one task that the MineRL competitions offer, which is obtaining the Iron Pickaxe. By implementing and evaluating the variety of algorithms in our unique research environment, we will be able to compare their performance against each other. Additionally, we will assess their performance against our new unique algorithm. This research aims to contribute to the existing body of knowledge and provide insights into the effectiveness of different AI approaches in the Minecraft domain.