Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

Graph-based Deep Q Network for Web Navigation

NotificationsYou must be signed in to change notification settings

Sheng-J/DOM-Q-NET

Repository files navigation

"DOM-Q-NET: Grounded RL on Structured Language"International Conference on Learning Representations (2019). Sheng Jia, Jamie Kiros, Jimmy Ba.[arxiv][openreview]
Architecture

Demo

Trained multitask agent:https://www.youtube.com/watch?v=eGzTDIvX4IY
Facebook login:https://www.youtube.com/watch?v=IQytRUKmWhs&t=2s

Requirement

Need to download selenium & install chrome driver for selenium..

Installation

  1. Clone this repo
  2. Download MiniWoB++ environment from the original repohttps://github.com/stanfordnlp/miniwob-plusplus
    and copy miniwob-plusplus/html folder to miniwob/html in this repo
  3. In fact, this html folder could be stored anywhere, but remember to perform one of the following actions:
  • Set environment variable"WOB_PATH" to
    file://"your-path-to-miniwob-plusplus"/html/miniwob
    E.g. "your-path-to-miniwob-plusplus" is "/h/sheng/DOM-Q-NET/miniwob
  • Directly modify thebase_url on line 33 of instance.py to
    "your-path-to-miniwob-plusplus"/html/miniwob
    In my case,base_url='file:///h/sheng/DOM-Q-NET/miniwob/html/miniwob/'

Run experiment

Experiment launch files are stored underrunsFor example,

cd runs/hard2medium9tasks/sh run1.sh

will launch a 11 multi-task (social-mediasearch-enginelogin-userenter-passwordclick-checkboxesclick-optionenter-dynamic-textenter-textemail-inbox-deleteclick-tab-2navigation-tree) experiment.

Multitask Assumptions

State & Action restrictions

ItemMaximum number of items
DOM tree leaves (action space)160
DOM tree200
Instruction tokens16

Attribute embeddings & vocabulary

Attributemax vocabularyEmbedding dimension
Tag10016
Text (shared with instructions)60048
Class10016
  • UNKnown tokens
    These are assigned to a random vector such that the cosine distance with the text attribute can yield 1.0 for the direct alignment.

Acknowledgement

Credit to Dopamine for the implementation of prioritized replay used in dstructs/dopamine_segtree.py

Releases

No releases published

Packages

No packages published

[8]ページ先頭

©2009-2025 Movatter.jp