Learning to Win by Reading Manuals in a Monte-Carlo Framework

作者:
S.R.K. Branavan branavan@csail.mit.edu
Computer Science and Artificial Intelligence Laboratory
Massachusetts Institute of Technology
David Silver d.silver@cs.ucl.ac.uk
Department of Computer Science
University College London
Regina Barzilay regina@csail.mit.edu
Computer Science and Artificial Intelligence Laboratory
Massachusetts Institute of Technology

摘要
Domain knowledge is crucial for effective performance in autonomous control systems. Typically, human effort is required to encode this knowledge into a control algorithm. In this paper, we present an approach to language grounding which automatically interprets text in the context of a complex control application, such as a game, and uses domain knowledge extracted from the text to improve control performance. Both text analysis and control strategies are learned jointly using only a feedback signal inherent to the application.

To effectively leverage textual information, our method automatically extracts the text segment most relevant to the current game state, and labels it with a task-centric predicate structure. This labeled text is then used to bias an action selection policy for the game, guiding it towards promising regions of the action space. We encode our model for text analysis and game playing in a multi-layer neural network, representing linguistic decisions via latent variables in the hidden layers, and game action quality via the output layer.

Operating within the Monte-Carlo Search framework, we estimate model parameters using feedback from simulated games. We apply our approach to the complex strategy game Civilization II using the official game manual as the text guide. Our results show that a linguistically-informed game-playing agent significantly outperforms its language-unaware counterpart, yielding a 34% absolute improvement and winning over 65% of games when playing against the built-in AI of Civilization.

最后編輯于
?著作權歸作者所有,轉載或內容合作請聯(lián)系作者
【社區(qū)內容提示】社區(qū)部分內容疑似由AI輔助生成,瀏覽時請結合常識與多方信息審慎甄別。
平臺聲明:文章內容(如有圖片或視頻亦包括在內)由作者上傳并發(fā)布,文章內容僅代表作者本人觀點,簡書系信息發(fā)布平臺,僅提供信息存儲服務。

相關閱讀更多精彩內容

  • **2014真題Directions:Read the following text. Choose the be...
    又是夜半驚坐起閱讀 11,081評論 0 23
  • 今天沒有做新的東西,而是把昨天分析的google keep的界面都畫了一遍然后再仔細分析。剛好加了一個在做交互的師...
    7win7閱讀 910評論 1 5
  • 在兒時的記憶里,家鄉(xiāng)有一供路人小憩的木亭子,沒名,姑且叫“街心的亭”吧,因為這亭也恰好在街道中間。家鄉(xiāng)與兒時記憶...
    小小佘閱讀 355評論 0 1
  • 白鳳,你過來?!毙l(wèi)莊號了號赤練的脈,神情嚴肅地對白鳳說。 白鳳一呆,低聲安撫了赤練,在她眉心落下一個吻后和衛(wèi)莊出了...
    秋商閱讀 1,275評論 4 11
  • 當你看到最后的時候,你會發(fā)現(xiàn),在那樣一個大背景下,對任何一個人的好壞作出評判都是很難的。
    愛閱讀的我閱讀 317評論 0 1

友情鏈接更多精彩內容