mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-05-07 02:51:50 +00:00
New browser capabilities and a built-in skill for agent-driven web QA. ## New tool: browser_console Returns console messages (log/warn/error/info) AND uncaught JavaScript exceptions in a single call. Uses agent-browser's 'console' and 'errors' commands through the existing session plumbing. Supports --clear to reset buffers. Verified working in both local and Browserbase cloud modes. ## Enhanced tool: browser_vision(annotate=True) New boolean parameter on browser_vision. When true, agent-browser overlays numbered [N] labels on interactive elements — each [N] maps to ref @eN. Annotation data (element name, role, bounding box) returned alongside the vision analysis. Useful for QA reports and spatial reasoning. ## Config: browser.record_sessions Auto-record browser sessions as WebM video files when enabled: - Starts recording on first browser_navigate - Stops and saves on browser_close - Saves to ~/.hermes/browser_recordings/ - Works in both local and cloud modes (verified) - Disabled by default ## Built-in skill: dogfood Systematic exploratory QA testing for web applications. Teaches the agent a 5-phase workflow: 1. Plan — accept URL, create output dirs, set scope 2. Explore — systematic crawl with annotated screenshots 3. Collect Evidence — screenshots, console errors, JS exceptions 4. Categorize — severity (Critical/High/Medium/Low) and category (Functional/Visual/Accessibility/Console/UX/Content) 5. Report — structured markdown with per-issue evidence Includes: - skills/dogfood/SKILL.md — full workflow instructions - skills/dogfood/references/issue-taxonomy.md — severity/category defs - skills/dogfood/templates/dogfood-report-template.md — report template ## Tests 21 new tests covering: - browser_console message/error parsing, clear flag, empty/failed states - browser_console schema registration - browser_vision annotate schema and flag passing - record_sessions config defaults and recording lifecycle - Dogfood skill file existence and content validation Addresses #315.
1.5 KiB
1.5 KiB
Dogfood QA Report
Target: {target_url} Date: {date} Scope: {scope_description} Tester: Hermes Agent (automated exploratory QA)
Executive Summary
| Severity | Count |
|---|---|
| 🔴 Critical | {critical_count} |
| 🟠 High | {high_count} |
| 🟡 Medium | {medium_count} |
| 🔵 Low | {low_count} |
| Total | {total_count} |
Overall Assessment: {one_sentence_assessment}
Issues
Issue #{issue_number}: {issue_title}
| Field | Value |
|---|---|
| Severity | {severity} |
| Category | {category} |
| URL | {url_where_found} |
Description: {detailed_description_of_the_issue}
Steps to Reproduce:
- {step_1}
- {step_2}
- {step_3}
Expected Behavior: {what_should_happen}
Actual Behavior: {what_actually_happens}
Screenshot: MEDIA:{screenshot_path}
Console Errors (if applicable):
{console_error_output}
Issues Summary Table
| # | Title | Severity | Category | URL |
|---|---|---|---|---|
| {n} | {title} | {severity} | {category} | {url} |
Testing Coverage
Pages Tested
- {list_of_pages_visited}
Features Tested
- {list_of_features_exercised}
Not Tested / Out of Scope
- {areas_not_covered_and_why}
Blockers
- {any_issues_that_prevented_testing_certain_areas}
Notes
{any_additional_observations_or_recommendations}