Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Releases: browser-use/web-ui

💥 Browser-Use + MCP: Unleashing Agent Power Beyond the Browser! 🚀

01 May 05:56
664f874
This commit was created on GitHub.com and signed with GitHub’sverified signature.
GPG key ID:B5690EEEBB952194
Verified
Learn about vigilant mode.
Compare
Choose a tag to compare
Loading

Hey everyone,

Get ready for a game-changing update! We're absolutely thrilled to announce the latest version of browser-use webui, packed with features that dramatically expand the capabilities of your AI agents. This release marks a significant milestone, especially with the deep integration ofbrowser-use and MCP!

Here's what's new and exciting:

  1. Full Compatibility withbrowser-use 0.1.48: We've updated our integration to be fully compatible withbrowser-use version 0.1.48. This means you can leverage all the latest features, improvements, and stability enhancements from the corebrowser-use library right out of the box. Stay on the cutting edge of browser automation! ✨

  2. Browser-use Meets MCP Servers: A New Era of Agent Power! 🥁 With the introduction ofMCP server support, we've unlocked a universe of possibilities for your agents!

    • What does this mean?browser-use can now seamlessly interact withexternal tools and services defined as MCP servers. Think of it as giving your browser agent access to a whole new set of limbs and senses outside of the web page!
    • Go Beyond Browsing: Yourbrowser-use agent can now run desktop commands (like file operations or launching applications via tools like@wonderwhy-er/desktop-commander), interact with local services, connect to databases, run scripts, and so much more. The potential is truly vast!
    • Simple Configuration: Getting started with MCP servers is straightforward. Simply define your desired servers in your configuration file using a structure like this JSON(Claude desktop MCP json):
    {"mcpServers": {"desktop-commander": {"command":"npx","args": ["-y","@wonderwhy-er/desktop-commander"      ]    }  }}
0501-mcp-test.mov
  1. Brand New Web UI for Agent Interaction:
  • Introducing our shiny new Web UI! 🌐 Interact with yourbrowser-use agent in a conversational manner directly from your browser. This UI allows you to:
  • Provide human intervention, when needed – whether it's solving a CAPTCHA, making a complex decision, or guiding the agent through an unexpected situation. It's seamless human-agent collaboration!
  1. Enhanced DeepResearch Agent (MCP Enabled!): We've also significantly upgraded ourdeepresearch agent! 🧠 This new version is more powerful and efficient at gathering and synthesizing information. And yes, italso fully supports the newMCP system, enabling it to use external tools for research tasks.

How to Update:

uv pip install -r requirements.txtpython webui.py

Thanks to all the contributors!

Thank you for your continued support! Happy building! 🎉

Assets2
Loading
vvincent1234, warmshao, Altair47, mhmdio, feouta, fenoulg001, simonchen2019, shahlaukik, Phila-china, adamrybinski, and 2 more reacted with hooray emojiValentinoMeg12, adamrybinski, and cndly reacted with heart emojirajyraman, zeshhaan, vvincent1234, warmshao, hiper2d, universea, Lusc1ous, se7uh, believableUN, marginal23326, and 7 more reacted with rocket emoji
25 people reacted

Security Update & UI Enhancements!

29 Mar 03:37
f4f36b4
This commit was created on GitHub.com and signed with GitHub’sverified signature.
GPG key ID:B5690EEEBB952194
Verified
Learn about vigilant mode.
Compare
Choose a tag to compare
Loading

Hello everyone,

We're happy to announce the new release of browser-use-webui! 🎉 This update brings support for new models, several improvements, and an important security fix.

  • Hotfix some issues: open multiple tabs, etc.

Here's what's new:

  • WebUI Compatibility: Updated the WebUI to be compatible with the latestbrowser-use==0.1.40. Rememberuv pip install -r requirements.txt.
  • 🐛Bug Fixes: Squashed several bugs to improve stability and performance.
  • 🎨UI Optimization: Refreshed the WebUI for a cleaner, more intuitive, and aesthetically pleasing user experience.
  • 🤖New Model Support:
    • Gemini: Added support forgemini-2.5-pro-exp-03-25. Simply input the model name directly.
    • DeepSeek: Added support forDeepSeek-V3-0324. Select thedeepseek-chat option and remember touncheckuse_vision for this model.
  • ⚙️Improved Config Handling: Reworked the WebUI config saving and loading mechanism. It's now more robust and adaptive to user configurations.

🚨 Important Security Update: 🚨

  • We havefixed a critical security vulnerability related to loading WebUI configurations usingpickle. Loading untrustedpickle files can potentially lead to arbitrary code execution.
  • To mitigate this risk, we havemigrated to usingjson for saving and loading WebUI settings. This is a much safer standard.
  • We strongly urge all users to update to v1.7 or later immediately to protect themselves. Pleaseavoid using older versions that load configurations viapickle. Your settings should automatically migrate where possible, but backing up your old config is always wise.

Thanks to@Wh1teZe , refer to#451

Contributors

  • @Wh1teZe
Wh1teZe
Loading
wonx, NakiriYuuzu, Hakim-Allaoui, Morito7, Miguel4516, zeikeafk, and warmshao reacted with thumbs up emoji
7 people reacted

DeepResearch Lands on Browser-Use Web UI, with Collaborative Agents! 🤖🤝📚

06 Feb 12:16
7de7d90
This commit was created on GitHub.com and signed with GitHub’sverified signature.
GPG key ID:B5690EEEBB952194
Verified
Learn about vigilant mode.
Compare
Choose a tag to compare
Loading

Thanks to@vvincent1234. Now, you can seamlessly leverage DeepResearch's advanced capabilities in WebUI.

Important Notes:

  • DeepResearch feature is currently in alpha version and under rapid development. Stay updated by watching this repository.
  • DeepResearch consumes relatively many tokens. Please reduce Max Search Iteration and Max Query per Iteration according to your needs. These two represent the maximum number of search iterations and the number of simultaneous queries per search iteration respectively.

What's New?

2025/02/09

  1. Hotfix some bugs
  2. Split extracted content and limit max content lenght

2025/02/07

  1. Added a stop button, allowing you to stop your research at any time.
  2. Use your own browser. However, using your own browser currently only supports single searches per iteration.
  3. Currently recommending the gemini-2.0-flash-thinking-exp-01-21 model. This is because excessively long extracted content can sometimes cause API call errors.

Key benefits of this integration include:

  • DeepResearch within Your Browser: Access all DeepResearch features directly in your own browser – no more need for external search APIs! 🌐
  • Collaborative Agents: Harness the power of multiple AI agents working in concert. 🤖🤝
  • Indexed Information Sources: Easily save and access all referenced articles for future reference, promoting transparency and ensuring the reliability of your research. 📚

How to Get Started:

  1. Update Your Code: Pull the latest version to experience the new features. ⬆️
  2. Choose a Powerful LLM: To fully utilize DeepResearch, select a reasoning-capable LLM such asgemini-2.0-flash-thinking-exp-01-21,deepseek-r1, oro3-mini. 🧠
  3. Enter Your Research Topic: Navigate to the DeepResearch section within the Browser-Use Web UI and input your research theme. 📝
  4. Configure Parameters: Adjust themax_search_iteration_input andmax_query_per_iter_input according to the complexity of your research. ⚙️
  5. Run Deep Research: Click the "run_deep_research" button and wait for your professional research report to be generated. ⏳

Demo:
https://www.youtube.com/watch?v=sguzGWuiRT8

Contributors

  • @vvincent1234
vvincent1234
Loading
lemonhall, vvincent1234, warmshao, FutureExcited, data5650, ferrygun, SethBurkart123, shayfrendt, se7uh, BrunoV21, and 9 more reacted with thumbs up emojiChroma01 reacted with laugh emojidigitull1, Chroma01, h8zw, gokulcoder7, and nirbhay-108ai reacted with hooray emojiVaneeza-7, vvincent1234, jorge123255, shayfrendt, warmshao, 3clyp50, Chroma01, gokulcoder7, rachidomj, and Asba3488 reacted with heart emojishayfrendt, vvincent1234, warmshao, digitull1, ubertati, Chroma01, and Bollo444 reacted with rocket emoji
29 people reacted

🚀 Local DeepSeek-r1 Power with Ollama!

28 Jan 12:52
0c9cb9b
This commit was created on GitHub.com and signed with GitHub’sverified signature.
GPG key ID:B5690EEEBB952194
Verified
Learn about vigilant mode.
Compare
Choose a tag to compare
Loading

Hey everyone,

We've just rolled out a new release packed with awesome updates:

  1. Browser-Use Upgrade: We're now fully compatible with the latestbrowser-use version 0.1.29! 🎉
  2. Local Ollama Integration: Get ready for completely local and private AI with support for the incredibledeepseek-r1 model via Ollama! 🏠

Before You Dive In:

  • Update Code: Don't forget togit pull to grab the latest code changes.
  • Reinstall Dependencies: Runpip install -r requirements.txt to ensure all your dependencies are up to date.

Important Notes ondeepseek-r1:

  • Model Size Matters: We've found thatdeepseek-r1:14b and larger models work exceptionally well! Smaller models may not provide the best experience, so we recommend sticking with the larger options. 🤔

How to Get Started with Ollama anddeepseek-r1:

  1. Install Ollama: Head over toollama and download/install Ollama on your system. 💻
  2. Rundeepseek-r1: Open your terminal and run the command:ollama run deepseek-r1:14b (or a larger model if you prefer).
  3. WebUI Setup: Launch the WebUI following the instructions. Here's acrucial step:Uncheck "Use Vision" and set"Max Actions per Step" to 1. ✅
  4. Enjoy! You're now all set to experience the power of localdeepseek-r1. Have fun! 🥳

Happy Chinese New Year! 🏮

Loading
nekolr, vvincent1234, kaensoft, hypn00o, kishangondaliya, abhimanyugupta90, onixus74, frostworx, SCharan24, c4nc, and 10 more reacted with hooray emojiCybertron-Ant, onixus74, Vaneeza-7, mehmetkahya0, LaserWolfie, ZUENS2020, tarandeep-singh2, and EkkoLin reacted with rocket emoji
24 people reacted

✨ DeepSeek-r1 + Browser-use = New Magic ✨

25 Jan 16:13
5bc4978
This commit was created on GitHub.com and signed with GitHub’sverified signature.
GPG key ID:B5690EEEBB952194
Verified
Learn about vigilant mode.
Compare
Choose a tag to compare
Loading

🚀 Exciting news! Your browser-use can now engage in deep thinking!

Notes:

  1. The current version is a preview version for DeepSeek-r1 under development, please keep updating code to use.
  2. The current version only support the official DeepSeek-r1 api to use.

How to Use:

  1. 🔑 Configure API Key: Make sure you have set the correct DEEPSEEK_API_KEY in your .env file.

  2. 🌐 Launch WebUI: Launch the WebUI as instructed in the README.

  3. 👀 Disable Vision: In Agent Settings, uncheck "Use_Vision".

  4. 🤖 Select Model: In LLM Provider, select "deepseek", and in Model Name, select "deepseek-reasoner".

  5. 🎉 Enjoy!

Loading
ErcinDedeoglu, f0x-sketch, sourabhvamdevan, Chroma01, slachiewicz, Nguyenhieu277, kai9987kai, TimRChen, wirjo, profplum700, and 6 more reacted with thumbs up emojiChroma01 reacted with laugh emojiaosan, venkatmidhunmareedu, vanducng, ergors, ErcinDedeoglu, Chroma01, and Vaneeza-7 reacted with hooray emojiLemniscate-world, dorukgezici, brianjking, chhatramani01, NMontanaBrown, vanducng, ergors, Vaneeza-7, dimbage, x66ccff, and 5 more reacted with heart emojiErcinDedeoglu, Chroma01, farukak, igorgomes3, and gagev reacted with rocket emoji
34 people reacted

Hotfix some errors

16 Jan 01:52
2654e6b
This commit was created on GitHub.com and signed with GitHub’sverified signature.
GPG key ID:B5690EEEBB952194
Verified
Learn about vigilant mode.
Compare
Choose a tag to compare
Loading
  1. Upgrade browser-use==0.1.19 to solve Font OS error on Windows.
  2. Fix parsing result error in stream feature(Headless=True), supported return agent history file.
  3. Fix status of Stop button in stream feature.

Please update latest codes andpip install -r requirements.txt

Loading
Chroma01, Judakuab, and gokulcoder7 reacted with thumbs up emojiChroma01, Missantdrop, and gokulcoder7 reacted with laugh emojiChroma01 reacted with hooray emojiChroma01 and gokulcoder7 reacted with heart emojiChroma01 reacted with rocket emoji
4 people reacted

New WebUI: Enhanced Features and Compatibility

13 Jan 15:28
be89b90
This commit was created on GitHub.com and signed with GitHub’sverified signature.
GPG key ID:B5690EEEBB952194
Verified
Learn about vigilant mode.
Compare
Choose a tag to compare
Loading
  1. A brand-new WebUI interface with added features like video display.
  2. Adapted for the latest version of browser-use, with native support for models like Ollama, Gemini, and DeepSeek. Please update your code and runpip install -r requirements.txt.
  3. Ability to stop agent tasks at any time.
  4. Real-time page display in the WebUI when headless=True.
  5. Improved custom browser usage, fixing a bug about using own browser on Mac.
  6. Support for Docker environment installation.
Loading
MaximKostash, basil03p, Vaneeza-7, Chroma01, vvincent1234, amaan8429, Judakuab, and zuoke reacted with thumbs up emojiChroma01 reacted with laugh emojiChroma01 reacted with hooray emojiChroma01 reacted with heart emojiChroma01 reacted with rocket emoji
8 people reacted

Original version

06 Jan 14:32
e481813
This commit was created on GitHub.com and signed with GitHub’sverified signature.
GPG key ID:B5690EEEBB952194
Verified
Learn about vigilant mode.
Compare
Choose a tag to compare
Loading
  1. A Brand New WebUI: We offer a comprehensive web interface that supports a wide range of browser-use functionalities. This UI is designed to be user-friendly and enables easy interaction with the browser agent.

  2. Expanded LLM Support: We've integrated support for various Large Language Models (LLMs), including: Gemini, OpenAI, Azure OpenAI, Anthropic, DeepSeek, Ollama etc. And we plan to add support for even more models in the future.

  3. Custom Browser Support: You can use your own browser with our tool, eliminating the need to re-login to sites or deal with other authentication challenges. This feature also supports high-definition screen recording.

  4. Customized Agent: We've implemented a custom agent that enhances browser-use with Optimized prompts.

Loading
shreytalreja25, Mr-Saadis, winko452, SaudTajdar09, Missantdrop, and uwase-aimee reacted with heart emojiYIA990 and uwase-aimee reacted with eyes emoji
7 people reacted

[8]ページ先頭

©2009-2025 Movatter.jp