Tactus

The First Browser AI Agent Extension with Agent Skills Support

Tactus is an innovative browser extension that brings the Agent Skills specification to the browser environment, enabling AI Agent to perform complex tasks through an extensible skill system.

Triggering skills can achieve prompt injection for specific scenarios, and common workflows can be encapsulated in scripts for execution, replacing repetitive automated operations of AI Agents. This approach is both fast and token-efficient.

Tactus Demo

✨ Core Features

🧩 Agent Skills System

Tactus is the first product to implement the Agent Skills specification in a browser extension:

Skill Import - Import spec-compliant Skill folders containing instructions, scripts, and resource files
Script Execution - Safely execute JavaScript scripts within pages
Trust Mechanism - First-time script execution requires user confirmation, with option to permanently trust

Skills Import Demo

🤖 Intelligent Conversation

Multi-Provider Support - Native support for Anthropic (Claude), Gemini (Google), and any OpenAI-compatible API provider
Multi-Model Switching - Configure multiple providers and switch models anytime
Streaming Response - Real-time AI replies with chain-of-thought display
ReAct Paradigm - Built-in complete tool calling loop, AI autonomously decides when to use tools
Stop Generation - Interrupt ongoing AI generation or tool calls at any time
Tab Lock - Automatically locks the tab during AI responses with a glowing animation to prevent accidental navigation
Preset Quick Actions - Quick action buttons for common prompts in the chat panel

🖼️ Image Vision Support

Vision Model Config - Configure vision capability independently per model
Paste Images - Paste images directly in the chat input to send to AI
Multimodal Chat - Support image-text mixed queries, letting AI understand screenshots, charts, and other visual content

Vision Demo

📄 Page Understanding

Smart Extraction - Uses Readability + Turndown to extract core page content and convert to Markdown
Selection Quote - Select text on page or in sidebar and quote it with one click
Context Awareness - AI determines whether to call page extraction tool; skips if skill script provides content
Raw Extraction Mode - Configure specific websites to skip Readability algorithm and extract raw page content directly

Page Interaction Demo

🔌 HTTP MCP Support

Connect to external tool servers via Model Context Protocol, extending AI capabilities:

MCP Server Connection - Add and manage multiple HTTP MCP Servers
Dynamic Tool Discovery - Automatically fetch available tools from MCP Server and integrate into conversations
Multiple Auth Methods:
- No authentication (public services)
- Bearer Token authentication
- OAuth 2.1 authentication (with auto-refresh)

https://github.com/user-attachments/assets/c7737e7e-dd2e-4888-a030-db40b9731f1d

🎨 Themes & Personalization

Theme Switching - Support light, dark, and system-follow theme modes
Floating Ball Toggle - Freely enable/disable the floating ball on the right side of the page
Selection Quote Toggle - Freely enable/disable the selection quote feature
Internationalization - Support Chinese/English interface switching

💾 Local Storage

Session Management - Conversation history stored locally with pagination support
Message Editing - Edit sent user messages and regenerate replies
Message Copy - One-click copy for AI replies
IndexedDB - Skills and files stored in local database
Privacy First - All data saved locally, nothing uploaded to any server

⚙️ Advanced Configuration

Web Content Char Limit - Configure maximum characters for extracted page content to control token usage
Tool Call Limit - Configure maximum tool calls per conversation to prevent infinite loops
Smart Base URL Handling - Auto-complete /v1/chat/completions path to simplify API configuration
API Endpoint Preview - Preview the full API endpoint URL below the Base URL input
Auto-Save Config - Model provider and MCP server configurations are automatically saved with debounce
Open Links in New Tabs - Links in AI responses open in new tabs to preserve your conversation

🚀 Quick Start

Download

Browser	Download
Chrome	Chrome Web Store
Firefox	Firefox Add-ons

Or download from GitHub Releases for manual installation.

Install

Chrome / Edge (recommended)
- Extract tactus.zip to a permanent directory
- Open chrome://extensions/ in Chrome
- Enable Developer mode (top right)
- Click Load unpacked (top left)
- Select the extracted tactus folder
Firefox (temporary debug install)
- Run npm run build:firefox
- Open about:debugging#/runtime/this-firefox
- Click Load Temporary Add-on
- Select .output/firefox-mv2/manifest.json

🛠️ Build from Source

Clone the repository

git clone https://github.com/Castor6/tactus.git
cd tactus

Install dependencies

npm install

Run in development mode

npm run dev
npm run dev:firefox

Build for production

npm run build
npm run build:firefox

Please use the Professional style within the .claude\skills\design-style skills to design with a unified extended UI style.

📖 User Guide

Configure AI Provider

Click the extension icon to open the sidebar
Click the settings button to enter configuration page
Add API provider (fill in name, API URL, key)
Fetch model list and select a model
Enable vision config in model settings if vision capability is needed

Configuration Demo

Import Skill

Find the Skills management area in settings page
Click "Import Skill" button
Select a folder containing SKILL.md
Confirm import and use it in conversations

Configure MCP Server

Find the MCP management area in settings page
Click to add an MCP Server
Fill in Server URL and select authentication method
Once connected, tools provided by MCP will be automatically integrated into conversations

Skill Folder Structure

my-skill/
├── SKILL.md          # Required: Skill definition and instructions
├── scripts/          # Optional: Executable scripts
│   └── fetch-data.js
├── references/       # Optional: Reference documents
│   └── api-docs.md
└── assets/           # Optional: Resource files
    └── template.json

SKILL.md Format

---
name: my-skill
description: This is an example skill
---

# Skill Instructions

When user needs to perform a task, follow these steps:

1. First analyze user requirements
2. Call scripts/fetch-data.js to fetch data
3. Organize and return results

🛠️ Tech Stack

Framework: WXT - Modern browser extension development framework
Frontend: Vue 3 + TypeScript
AI Integration: Native Anthropic (Claude) API, Native Gemini (Google) API, OpenAI SDK (compatible with any OpenAI API)
MCP Support: @modelcontextprotocol/sdk - Model Context Protocol client
Content Extraction: @mozilla/readability + turndown
Storage: IndexedDB (idb) + WXT Storage
Authentication: OAuth 2.1 support (for MCP Server auth)

🔧 Built-in Tools

Tactus provides the following built-in tools for AI:

Tool	Description
`extract_page_content`	Extract main content from current page
`activate_skill`	Activate a specified Skill
`execute_skill_script`	Execute script within a Skill
`read_skill_file`	Read file content from a Skill
MCP Tools	Dynamically fetched tools from connected MCP Servers

📝 Development

Project Structure

tactus/
├── entrypoints/
│   ├── background.ts      # Background script
│   ├── content.ts         # Content script
│   ├── sidepanel/         # Sidebar UI
│   └── options/           # Settings page
├── components/            # Vue components
├── utils/
│   ├── anthropic.ts       # Anthropic (Claude) API integration
│   ├── api.ts             # API calls & tool execution
│   ├── db.ts              # IndexedDB operations
│   ├── gemini.ts          # Gemini (Google) API integration
│   ├── storage.ts         # Settings & storage management
│   ├── i18n.ts            # Internationalization
│   ├── mcp.ts             # MCP Client management
│   ├── mcpOAuth.ts        # MCP OAuth 2.1 auth
│   ├── mcpStorage.ts      # MCP config storage
│   ├── skills.ts          # Skills core logic
│   ├── skillsExecutor.ts  # Script executor
│   ├── skillsImporter.ts  # Skills import
│   ├── tools.ts           # Tool definitions
│   └── pageExtractor.ts   # Page content extraction
└── public/                # Static assets

🤝 Contributing

Issues and Pull Requests are welcome!

🚧 Future Roadmap

We have exciting plans for the future development of Tactus:

Introduce CDP automation as a tool for the Agent, allowing manual intervention and takeover.
Record operations and generate reusable skills with a single click.
Tackle the challenges of long-term stable automation tasks.

📄 License

Apache-2.0 License

⭐ Star History

Friendship Link

Thanks for the support and feedback from the friends at LINUX DO. Let's make tactus better and better together!

Tactus - granting AI the sense of touch to navigate the web 🚀