Releases: ServiceNow/BrowserGym
Releases · ServiceNow/BrowserGym
v0.14.3
What's Changed
- Add "BrowserGym Ecosystem" paper in the readme by @xhluca in #363
- Add WebArena Lite by @amanjaiswal73892 in #368
- update webarenalite requirement and toml by @amanjaiswal73892 in #369
- Version bump dev by @amanjaiswal73892 in #371
- Webarenalite build fix by @amanjaiswal73892 in #374
- Update Sigstore action to version 3.1.0 by @amanjaiswal73892 in #376
- Fix bug in double click by @marksibrahim in #370
- Fix ci by @amanjaiswal73892 in #378
- Add traces section to README by @amanjaiswal73892 in #380
- WebArena Verified by @NicolasAG in #377
- Fix broken links by @NicolasAG in #381
- OpenApps is BrowserGym native | Edits to README by @KarenUllrich in #379
- version-bump 0.14.3 by @amanjaiswal73892 in #382
New Contributors
- @marksibrahim made their first contribution in #370
- @NicolasAG made their first contribution in #377
- @KarenUllrich made their first contribution in #379
Full Changelog: v0.14.2...v0.14.3
Release v0.14.3.dev4
What's Changed
- Add "BrowserGym Ecosystem" paper in the readme by @xhluca in #363
- Add WebArena Lite by @amanjaiswal73892 in #368
- update webarenalite requirement and toml by @amanjaiswal73892 in #369
- Version bump dev by @amanjaiswal73892 in #371
- Webarenalite build fix by @amanjaiswal73892 in #374
- Update Sigstore action to version 3.1.0 by @amanjaiswal73892 in #376
- Fix bug in double click by @marksibrahim in #370
- Fix ci by @amanjaiswal73892 in #378
New Contributors
- @marksibrahim made their first contribution in #370
Full Changelog: v0.14.2...v0.14.3.dev4
v0.14.2
What's Changed
- Miniwob zoom by @recursix in #352
- prevent lxml to go to version 6 by @recursix in #356
- Fix_bbox_scale by @recursix in #357
- Make miniwob installation automatic. by @amanjaiswal73892 in #359
- Fill->type to trigger drop down by @recursix in #358
New Contributors
- @amanjaiswal73892 made their first contribution in #359
Full Changelog: v0.14.1...v0.14.2
v0.14.1: Miniwob zoom (#351)
What's Changed
- Some changes in preparation for agentlab's new ToolUseAgent and the new APIs by @TLSDC in #340
- Miniwob zoom by @recursix in #351
Full Changelog: v0.13.4...v0.14.1
v0.13.3: minor fixes
What's Changed
browsergym-core
- Optional method
AbstractBrowserTask.get_task_id()#281 - Fixed
BrowserEnvparameterresizeable_window, now working as expected #281
browsergym-experiments
- Metadata column fix for visualwebarena #278
Full Changelog: v0.13.2...v0.13.3
v0.13.2: experiments updates
What's Changed
browsergym-experiments
- Experiment traces can now be exported into the TapeAgents format #238
- Installs weblinx_browsergym as a dependency #261
- WA/VWA full instance reset will only issue a warning instead of crashing if not properly set-up #272
- New debug benchmark
visualwebarena_tiny#271
Full Changelog: v0.13.1...v0.13.2
v0.13.1: Many small fixes
What's Changed
browsergym-experiments
browsergym-core
- Fixed gym warnings "obs not within observation space" #251
- Trace downgrades from
INFOtoDEBUG#252 - More robust
env.close(), can now be used in a finally block even after reset failure #253 - Optional
AbstractBrowserTask.teardown()method #255 - Browsergym's
register_task()now supports both frozen, non-overrideabletask_kwargsas well as overrideabledefault_task_kwargsarguments #255 - More robust frame marking #256 #258
browsergym-assistantbench
- Refactored AssistantBench mechanism for saving test predictions to JSON files #242
browsergym-webarena
- Relaxed playwright<1.40 restriction #257
browsergym-visualwebarena
- Relaxed playwright<1.40 restriction #257
Full Changelog
v0.13.0: Minor updates
What's changed
browsergym-core
- More robust frame marking with lenient last try #245
- Tasks can now choose their own
localeandtimezone_id#244
browsergym-experiments
- Pre-download WebLINX data in prepare_backend() #226
- Increase AssistantBench max_steps to 30 #244
- Add
select_optionto webarena / visualwebarena default action set #247
browsergym-visualwebarena
- Hide huggingface progress bar when downloading the visual evaluation model #241
browsergym-assistantbench
- Set
locale="en-US"andtimezone_id="America/New_York"
Full Changelog: v0.12.0...v0.13.0
v0.12.0: VisualWebarena / WebLINX bugfixes
Bugfixes
browsergym-experiments
- Fixes WebLINX task list #235
- Refactors experiment ID generation #236
- Adds VisualWebArena task dependencies #237 #239
browsergym-visualwebarena
- Fixes VisualWebArena tasks with visual validation (missing captioning_fn in evaluator) #240
- Adds a
torchdependency (to run the captioning model) #240
Full Changelog: v0.11.3...v0.12.0
v0.11.3: Minor fixes
Bugfixes
- Fix duplicate depends_on in webarena metadata #228
Improvements
- Easier webarena / visualwebarena setup with (running
nltk.download()at import time) #227 - More robust
full_reset()for webarena / visualwebarena #230 - Removed ARIA extraction warnings #233
- New benchmark configuration
webarena_tiny#232
Full Changelog: v0.11.2...v0.11.3