tactus

agent
Security Audit
Pass
Health Pass
  • License — License: Apache-2.0
  • Description — Repository has a description
  • Active repo — Last push 1 days ago
  • Community trust — 273 GitHub stars
Code Pass
  • Code scan — Scanned 12 files during light audit, no dangerous patterns found
Permissions Pass
  • Permissions — No dangerous permissions requested

No AI report is available for this listing yet.

SUMMARY

The first browser AI Agent extension to support Agent Skills, enabling AI to perform complex tasks through an expandable skill system. 🌟 Star if you like it! | 首个支持 Agent Skills 的浏览器 AI Agent 扩展,让 AI 通过可扩展技能系统执行复杂任务 🌟 如果喜欢请点个 Star!

README.md

English | 简体中文

icon Tactus

The First Browser AI Agent Extension with Agent Skills Support

Chrome Version
Chrome Users
Firefox Version
Firefox Users

Tactus is an innovative browser extension that brings the Agent Skills specification to the browser environment, enabling AI Agent to perform complex tasks through an extensible skill system.

Triggering skills can achieve prompt injection for specific scenarios, and common workflows can be encapsulated in scripts for execution, replacing repetitive automated operations of AI Agents. This approach is both fast and token-efficient.

Tactus Demo
Tactus Demo

✨ Core Features

🧩 Agent Skills System

Tactus is the first product to implement the Agent Skills specification in a browser extension:

  • Skill Import - Import spec-compliant Skill folders containing instructions, scripts, and resource files
  • Script Execution - Safely execute JavaScript scripts within pages
  • Trust Mechanism - First-time script execution requires user confirmation, with option to permanently trust

Skills Import Demo

🤖 Intelligent Conversation

  • Multi-Provider Support - Native support for Anthropic (Claude), Gemini (Google), and any OpenAI-compatible API provider
  • Multi-Model Switching - Configure multiple providers and switch models anytime
  • Streaming Response - Real-time AI replies with chain-of-thought display
  • ReAct Paradigm - Built-in complete tool calling loop, AI autonomously decides when to use tools
  • Stop Generation - Interrupt ongoing AI generation or tool calls at any time
  • Tab Lock - Automatically locks the tab during AI responses with a glowing animation to prevent accidental navigation
  • Preset Quick Actions - Quick action buttons for common prompts in the chat panel

🖼️ Image Vision Support

  • Vision Model Config - Configure vision capability independently per model
  • Paste Images - Paste images directly in the chat input to send to AI
  • Multimodal Chat - Support image-text mixed queries, letting AI understand screenshots, charts, and other visual content

Vision Demo

📄 Page Understanding

  • Smart Extraction - Uses Readability + Turndown to extract core page content and convert to Markdown
  • Selection Quote - Select text on page or in sidebar and quote it with one click
  • Context Awareness - AI determines whether to call page extraction tool; skips if skill script provides content
  • Raw Extraction Mode - Configure specific websites to skip Readability algorithm and extract raw page content directly

Page Interaction Demo

🔌 HTTP MCP Support

Connect to external tool servers via Model Context Protocol, extending AI capabilities:

  • MCP Server Connection - Add and manage multiple HTTP MCP Servers
  • Dynamic Tool Discovery - Automatically fetch available tools from MCP Server and integrate into conversations
  • Multiple Auth Methods:
    • No authentication (public services)
    • Bearer Token authentication
    • OAuth 2.1 authentication (with auto-refresh)

https://github.com/user-attachments/assets/c7737e7e-dd2e-4888-a030-db40b9731f1d

🎨 Themes & Personalization

  • Theme Switching - Support light, dark, and system-follow theme modes
  • Floating Ball Toggle - Freely enable/disable the floating ball on the right side of the page
  • Selection Quote Toggle - Freely enable/disable the selection quote feature
  • Internationalization - Support Chinese/English interface switching

💾 Local Storage

  • Session Management - Conversation history stored locally with pagination support
  • Message Editing - Edit sent user messages and regenerate replies
  • Message Copy - One-click copy for AI replies
  • IndexedDB - Skills and files stored in local database
  • Privacy First - All data saved locally, nothing uploaded to any server

⚙️ Advanced Configuration

  • Web Content Char Limit - Configure maximum characters for extracted page content to control token usage
  • Tool Call Limit - Configure maximum tool calls per conversation to prevent infinite loops
  • Smart Base URL Handling - Auto-complete /v1/chat/completions path to simplify API configuration
  • API Endpoint Preview - Preview the full API endpoint URL below the Base URL input
  • Auto-Save Config - Model provider and MCP server configurations are automatically saved with debounce
  • Open Links in New Tabs - Links in AI responses open in new tabs to preserve your conversation

🚀 Quick Start

Download

Browser Download
Chrome Chrome Web Store
Firefox Firefox Add-ons

Or download from GitHub Releases for manual installation.

Install

  • Chrome / Edge (recommended)
    • Extract tactus.zip to a permanent directory
    • Open chrome://extensions/ in Chrome
    • Enable Developer mode (top right)
    • Click Load unpacked (top left)
    • Select the extracted tactus folder
  • Firefox (temporary debug install)
    • Run npm run build:firefox
    • Open about:debugging#/runtime/this-firefox
    • Click Load Temporary Add-on
    • Select .output/firefox-mv2/manifest.json

🛠️ Build from Source

  1. Clone the repository
git clone https://github.com/Castor6/tactus.git
cd tactus
  1. Install dependencies
npm install
  1. Run in development mode
npm run dev
npm run dev:firefox
  1. Build for production
npm run build
npm run build:firefox
  1. Please use the Professional style within the .claude\skills\design-style skills to design with a unified extended UI style.

📖 User Guide

Configure AI Provider

  1. Click the extension icon to open the sidebar
  2. Click the settings button to enter configuration page
  3. Add API provider (fill in name, API URL, key)
  4. Fetch model list and select a model
  5. Enable vision config in model settings if vision capability is needed

Configuration Demo

Import Skill

  1. Find the Skills management area in settings page
  2. Click "Import Skill" button
  3. Select a folder containing SKILL.md
  4. Confirm import and use it in conversations

Configure MCP Server

  1. Find the MCP management area in settings page
  2. Click to add an MCP Server
  3. Fill in Server URL and select authentication method
  4. Once connected, tools provided by MCP will be automatically integrated into conversations

Skill Folder Structure

my-skill/
├── SKILL.md          # Required: Skill definition and instructions
├── scripts/          # Optional: Executable scripts
│   └── fetch-data.js
├── references/       # Optional: Reference documents
│   └── api-docs.md
└── assets/           # Optional: Resource files
    └── template.json

SKILL.md Format

---
name: my-skill
description: This is an example skill
---

# Skill Instructions

When user needs to perform a task, follow these steps:

1. First analyze user requirements
2. Call scripts/fetch-data.js to fetch data
3. Organize and return results

🛠️ Tech Stack

  • Framework: WXT - Modern browser extension development framework
  • Frontend: Vue 3 + TypeScript
  • AI Integration: Native Anthropic (Claude) API, Native Gemini (Google) API, OpenAI SDK (compatible with any OpenAI API)
  • MCP Support: @modelcontextprotocol/sdk - Model Context Protocol client
  • Content Extraction: @mozilla/readability + turndown
  • Storage: IndexedDB (idb) + WXT Storage
  • Authentication: OAuth 2.1 support (for MCP Server auth)

🔧 Built-in Tools

Tactus provides the following built-in tools for AI:

Tool Description
extract_page_content Extract main content from current page
activate_skill Activate a specified Skill
execute_skill_script Execute script within a Skill
read_skill_file Read file content from a Skill
MCP Tools Dynamically fetched tools from connected MCP Servers

📝 Development

Project Structure

tactus/
├── entrypoints/
│   ├── background.ts      # Background script
│   ├── content.ts         # Content script
│   ├── sidepanel/         # Sidebar UI
│   └── options/           # Settings page
├── components/            # Vue components
├── utils/
│   ├── anthropic.ts       # Anthropic (Claude) API integration
│   ├── api.ts             # API calls & tool execution
│   ├── db.ts              # IndexedDB operations
│   ├── gemini.ts          # Gemini (Google) API integration
│   ├── storage.ts         # Settings & storage management
│   ├── i18n.ts            # Internationalization
│   ├── mcp.ts             # MCP Client management
│   ├── mcpOAuth.ts        # MCP OAuth 2.1 auth
│   ├── mcpStorage.ts      # MCP config storage
│   ├── skills.ts          # Skills core logic
│   ├── skillsExecutor.ts  # Script executor
│   ├── skillsImporter.ts  # Skills import
│   ├── tools.ts           # Tool definitions
│   └── pageExtractor.ts   # Page content extraction
└── public/                # Static assets

🤝 Contributing

Issues and Pull Requests are welcome!

🚧 Future Roadmap

We have exciting plans for the future development of Tactus:

  • Introduce CDP automation as a tool for the Agent, allowing manual intervention and takeover.
  • Record operations and generate reusable skills with a single click.
  • Tackle the challenges of long-term stable automation tasks.

📄 License

Apache-2.0 License

⭐ Star History

Star History Chart

Friendship Link

Thanks for the support and feedback from the friends at LINUX DO. Let's make tactus better and better together!


Tactus - granting AI the sense of touch to navigate the web 🚀

Reviews (0)

No results found