|
7 maanden geleden | |
---|---|---|
extension | 7 maanden geleden | |
src | 7 maanden geleden | |
.gitignore | 7 maanden geleden | |
LICENSE | 7 maanden geleden | |
README.md | 7 maanden geleden | |
config_example.yaml | 7 maanden geleden | |
install.sh | 7 maanden geleden | |
pyproject.toml | 7 maanden geleden |
Open source multi-agent browser automation tool with built-in Chrome extension. NanoBrowser makes it easy to perform complex browser tasks using natural language commands.
Chrome Extension Management page (chrome://extensions/
) with "Developer mode" enabled
Python Environment:
Python 3.10 or higher
chrome://extensions/
Select the nanobrowser/extension/src
directory
Install the NanoBrowser Agent Server: ```bash
cd nanobrowser
./install.sh
> Note: The installation script has only been tested on MacOS. For other platforms, please refer to [uv's official documentation](https://github.com/astral-sh/uv) to install uv and dependencies manually.
3. **Configure API Keys**:
Create a `.env` file in the nanobrowser directory and add your API key. The default LLM provider is OpenAI:
```bash
OPENAI_API_KEY=your_openai_key_here
If you want to use Claude (Anthropic), you'll need to modify the provider in config.yaml
and use:
ANTHROPIC_API_KEY=your_anthropic_key_here
bash
uv run nanobrowser
## Example Tasks
Here are some tasks you can accomplish with NanoBrowser using natural language:
1. News Extraction:
> "Go to TechCrunch and extract top 10 headlines from the last 24 hours"
2. GitHub Research:
> "Look for the trending Python repositories on GitHub with most stars"
3. Shopping Research:
> "Find a portable Bluetooth speaker on Amazon with a water-resistant design, under $50. It should have a minimum battery life of 10 hours"
4. Recipe Analysis:
> "Find a recipe with over 100 reviews for Fried Fish on Allrecipes, list the Full Nutrition Label and tell me the amount of Iron per Serving"
## Roadmap
Here are the key features we're working on:
1. Standalone Extension:
> "Simplify installation by making the Chrome extension work without a backend server"
2. Enhanced Tab Management:
> "Support multiple tabs with context awareness and state synchronization"
3. Conversation Intelligence:
> "Enable natural follow-up questions and preserve context between tasks"
4. Advanced DOM Analysis:
> "Improve interaction with complex web elements like canvas and dynamic content"
---
## Contributing
Contributions are welcome! Feel free to:
- Report bugs
- Suggest new features
- Submit pull requests
## Acknowledgments
Special thanks to these inspiring projects:
- python-cdp
- Agent-E
- fuji-web
## License
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
Made with ❤️ by the NanoBrowser Team