Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: browser-use/web-ui
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: 7eb62d4
Choose a base ref
...
head repository: drumnation/browser-use-cli
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: dedbe69
Choose a head ref
  • 11 commits
  • 47 files changed
  • 1 contributor

Commits on Jan 19, 2025

  1. Configuration menu
    Copy the full SHA
    4d10c51 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0829027 View commit details
    Browse the repository at this point in the history
  3. Enhance Google Gemini integration and environment configuration

    - Added support for Google Gemini API in `test_gemini_connection.py`, including model validation and content generation.
    - Updated `.gitignore` to include `.env.google` for environment variable management.
    - Modified `browser_use_cli.py` to dynamically set the model name from environment variables.
    - Improved `get_llm_model` function in `utils.py` to handle model names and API keys more securely using `SecretStr`.
    - Updated tests in `test_browser_cli.py` to set the Google model via environment variable for better test isolation.
    drumnation committed Jan 19, 2025
    Configuration menu
    Copy the full SHA
    c5a5d79 View commit details
    Browse the repository at this point in the history
  4. Add browser tracing and debugging features with enhanced CLI support

    - Introduced a new `pytest_output.txt` file to capture test session logs.
    - Updated `browser_use_cli.py` to manage browser state more effectively and support tracing options.
    - Enhanced `browser-tasks-example.ts` with new tasks for page structure analysis and debugging sessions.
    - Modified `custom_prompts.py` to ensure detailed reporting on page structure.
    - Improved `custom_context.py` to handle tracing and context management.
    - Expanded tests in `test_browser_cli.py` to cover new CLI commands and tracing functionalities.
    drumnation committed Jan 19, 2025
    Configuration menu
    Copy the full SHA
    c8ad372 View commit details
    Browse the repository at this point in the history
  5. Enhance README and add CLI usage guide for browser automation

    - Expanded the README.md to include a detailed purpose for the fork, CLI documentation, example tasks, and configuration options.
    - Introduced a new cli/usage-guide.md file that provides comprehensive instructions on using the browser-use API for automation with various LLM models.
    - Included code snippets for basic setup, browser context configuration, model configuration, agent setup, and common tasks.
    - Added best practices for error handling, resource management, and performance optimization in the usage guide.
    drumnation committed Jan 19, 2025
    Configuration menu
    Copy the full SHA
    0705352 View commit details
    Browse the repository at this point in the history

Commits on Jan 20, 2025

  1. Enhance CLI functionality and README documentation for browser automa…

    …tion
    
    - Added a comprehensive "Quick Start" section and detailed CLI commands to the README.md, improving user guidance for browser automation tasks.
    - Updated `browser_use_cli.py` to support additional options for browser initialization, including headless mode, window size, security settings, user data directory, and proxy configuration.
    - Improved error handling during browser initialization and reinitialization processes to ensure consistent browser state management.
    drumnation committed Jan 20, 2025
    Configuration menu
    Copy the full SHA
    b4a9e3a View commit details
    Browse the repository at this point in the history
  2. Refactor browser state management and enhance CLI options

    - Updated `_get_browser_state` and `_set_browser_state` functions to utilize a temporary file for managing browser state instead of environment variables.
    - Added `--temp-file` argument to `start`, `run`, and `close` commands for specifying the temporary file path.
    - Enhanced `browser-use` script to create and manage a temporary state file during execution.
    - Introduced new options in `browser-tasks-example.ts` for session recording and debugging traces.
    - Updated `browser-use.toolchain.json` to include new parameters for recording and tracing.
    - Added VSCode settings for improved development experience.
    drumnation committed Jan 20, 2025
    Configuration menu
    Copy the full SHA
    8c0118a View commit details
    Browse the repository at this point in the history
  3. Add demo logging feature and enhance structured logging system

    - Introduced a new demo_logging.py file to showcase the logging capabilities of the TaskLogger, including navigation, interaction, and data extraction phases.
    - Enhanced the structured logging system with color-coded log levels, task states, and visual separators for better readability.
    - Implemented error handling improvements with smart retry logic and structured error logging.
    - Added support for detailed performance metrics and progress tracking in task execution.
    - Updated README.md to reflect the new logging features and improvements in the logging system.
    - Introduced new utility functions for structured logging and error handling in the src/utils directory.
    - Added comprehensive tests for the new logging features and error handling mechanisms.
    drumnation committed Jan 20, 2025
    Configuration menu
    Copy the full SHA
    c01bddf View commit details
    Browse the repository at this point in the history
  4. Enhance CLI functionality and update README for improved LLM provider…

    … support
    
    - Refactored CLI commands to allow selection of LLM providers (DeepSeek, Google, OpenAI, Anthropic) with model indexing options.
    - Updated `README.md` to include a comprehensive CLI guide, detailing usage patterns, available models, and environment configuration.
    - Enhanced `browser_use_cli.py` to normalize provider names and select appropriate models based on user input.
    - Modified `browser-use.toolchain.json` to reflect new provider and model options, including descriptions for better user understanding.
    - Improved error handling in model selection and added tests for new provider functionalities in `test_llm_api.py`.
    - Removed obsolete VSCode settings file to streamline project structure.
    drumnation committed Jan 20, 2025
    Configuration menu
    Copy the full SHA
    612587d View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    7d29eba View commit details
    Browse the repository at this point in the history
  6. Enhance browser automation CLI with URL validation and tracing features

    - Added URL validation to the `run_browser_task` function, ensuring only well-formed URLs are accepted.
    - Introduced new command-line options for specifying starting URLs and controlling trace analysis visibility.
    - Updated `README.md` and `browser-use.toolchain.json` to reflect new URL requirements and usage examples.
    - Implemented a new `trace_analyzer.py` module for analyzing browser traces, providing detailed insights into actions and network requests.
    - Added comprehensive tests for URL handling and trace analysis in `test_browser_use_cli.py` and `test_trace_analyzer.py`.
    - Updated `.gitignore` to exclude trace files and added a new `test_results.txt` for logging test outputs.
    drumnation committed Jan 20, 2025
    Configuration menu
    Copy the full SHA
    dedbe69 View commit details
    Browse the repository at this point in the history
Loading