Open Source Directory

Open-Source AI Repos & MCP Servers

Curated directory of trending GitHub repositories, MCP servers, and AI agent skills for developers and researchers.

Updated Daily
Hand-Picked Quality
Free & Open Source

Featured Repositories

View All

Hand-picked open source projects driving AI forward.

openai/openai-agents-python
0

OpenAI's official lightweight Python framework for multi-agent workflows. Released April 2026 as OpenAI's answer to Microsoft Agent Framework and Google ADK. Small surface area, opinionated primitives, first-class tool calls against the OpenAI API. If you already ship on GPT and Responses API, this is the path of least resistance to multi-agent. The SDK ships four primitives: Agents (LLMs with instructions and tools), Handoffs (one agent delegating to another), Guardrails (input and output validation), and Sessions (automatic conversation history). That is the entire mental model. Compare to LangGraph's graph-of-nodes complexity or Microsoft's full stack with DevUI and MCP Apps, and the OpenAI SDK reads like a deliberate reduction. No orchestration DSL, no registry, no runtime UI. Just Python. Early ecosystem response has been strong — the repo has trended on GitHub since launch and is already on PyPI. Tracing is built in with OpenAI's dashboard. MCP support is shipped. Local model compatibility lands through LiteLLM. The bet is that a smaller framework wins developer mindshare because it gets out of the way. The critique: minimalism cuts both ways. There is no built-in DevUI like Microsoft Agent Framework 1.0 ships, no visual tool-call debugger, no LTS commitment. If you need a production-ready agent platform with enterprise governance, you are on your own for observability, deployment, and versioning. For prototyping and for OpenAI-first stacks, it is the cleanest option.

Python
0

Latest MCP Servers

View All

Connect your agents to external tools and data sources.

You know the workflow. Write code locally. Copy into a Colab notebook. Run it for GPU access. Find a bug. Copy back to your editor. Fix. Paste again. Repeat until your sanity erodes.Google Colab MCP Server ends that workflow. Any AI agent that speaks Model Context Protocol — Claude Code, Cursor, Windsurf, Copilot, Gemini CLI — can now control Colab notebooks directly. The agent creates the notebook, writes the cells, installs dependencies, runs the code, and iterates. You watch the notebook being built in real-time.Google's Colab team published the server at github.com/googlecolab/colab-mcp. Apache 2.0 license. Python 3.11+. Free.What It Actually DoesThe server exposes Colab functionality as MCP tools. An agent can:Create new .ipynb files with a specified runtime (CPU, GPU, TPU)Inject markdown cells for documentation and section headersDraft and execute Python code cells, receiving outputs including errors and stack tracesInstall pip dependencies and apt packages in the runtimeRearrange cells for logical flow as the notebook evolvesRead outputs and iterate based on resultsSave and share the completed notebookThe agent writes the notebook the way a human would — incrementally, watching what works, adjusting what does not. The difference is speed. What takes a human 30 minutes of copy-paste-run-debug takes the agent three minutes.InstallationOne command with uv: uvx install colab-mcp. Then configure your MCP client (Claude Desktop, Cursor, Gemini CLI) to point at the installed server. The README has copy-paste configs for each major client.The server authenticates to Colab through your Google account. On first use, you grant the MCP server access to your Drive (where .ipynb files live) and Colab's runtime API. Standard OAuth flow, revocable from your Google account settings.The GPU Access AngleThis is where it gets interesting. Colab provides free GPU runtimes (T4, A100 when available for Pro users). Before this server, your AI agent on your laptop had no practical path to GPU compute for ML workloads. Now it does.A workflow that was impossible yesterday is now trivial: ask Claude Code to train a small PyTorch model on MNIST, iterate on the architecture, and tune hyperparameters. The agent launches a GPU runtime in Colab, runs the training, reads the results, adjusts, and repeats. Your local machine never runs a single training step.Why Google Released ThisGoogle's 2026 strategy has been relentlessly open. The Gemini CLI went open-source. The Android ADK is MCP-compatible. Firebase is publishing MCP integrations. Now Colab. The pattern: Google is betting that becoming the default AI infrastructure for any agent is more valuable than protecting proprietary tooling.This is an opposite bet from Anthropic's Claude-first approach. Anthropic wants agents built on Claude. Google wants Google infrastructure used by agents built on any model. The Colab MCP Server is a clean example: it works with Claude, not just Gemini. Google benefits when anyone's agent runs on Colab's GPUs, not just Gemini-powered agents.Use Cases That Make SenseML experimentation: ask an agent to run a hyperparameter sweep, compare architectures, produce a plot. Colab's free GPU tier handles the compute.Data exploration: hand a CSV to an agent, ask it to 'find the five most interesting patterns.' The agent loads the file into a Colab notebook, runs pandas and seaborn cells, and produces a written summary with supporting visualizations.Rapid prototyping: 'Prototype a semantic search over this document set using sentence-transformers.' The agent installs dependencies, embeds the documents, sets up the search, and produces a working demo. You watch the notebook being built.Reproducible research: agents build notebooks that include the full reasoning and experimentation trail, not just the final code. The notebook is the audit log.LimitationsThe server authenticates to a single Google account at a time. Multi-tenant deployment requires workarounds. Teams that want to share agent-generated notebooks across users face permission complexity.Colab's free tier has compute quotas. Agents that run long training jobs or large hyperparameter sweeps can hit the quota mid-run. The server does not currently queue or retry against the quota wall — the agent has to handle the error gracefully.Runtime state is tied to the notebook's active session. If Colab disconnects (idle timeout, browser close), the runtime variables are lost. The agent can re-execute cells, but any expensive computation is repeated.The server is new — version 0.1.0 at launch. Expect breaking changes over the next few minor versions as Google iterates based on usage.For more MCP servers, browse repos.skila.ai/servers. For AI coding and agent tools, check tools.skila.ai. For articles on Model Context Protocol adoption, visit news.skila.ai.

MCP serverGoogle Colab MCPModel Context Protocol Colab+4
other

n8n-MCP turns your AI coding assistant into an n8n workflow automation expert. It bridges the Model Context Protocol with n8n's entire node ecosystem: 1,396 nodes (812 core + 584 community), 2,700+ workflow templates, and 265 AI-capable tool variants.The practical difference: before n8n-MCP, asking Claude Code to build an n8n workflow was a coin flip. The model would hallucinate node names, invent non-existent parameters, and produce workflows that failed validation. n8n-MCP eliminates that by feeding the AI comprehensive, accurate documentation for every node, operation, and property.The creator, Romuald Czlonkowski, claims it reduces workflow creation time from 45 error-prone minutes to 3 flawless minutes. That number comes from community reports, not formal benchmarks. But the architecture supports it: 99% property coverage, 87% documentation availability, and 2,646 pre-extracted real-world configurations from popular templates.How It WorksThe MCP server exposes n8n documentation as structured tools that AI assistants can query. When you ask Claude Code to "build a workflow that triggers on a new Stripe payment and sends a Slack notification," the server provides exact node configurations, credential schemas, and expression syntax. The AI does not guess. It looks up the correct parameters.A Chat Agent feature lets you describe automations in plain language and receive production-ready workflows. Diff-based updates allow modifying existing workflows without rebuilding from scratch. Real-time monitoring and auto-validation catch errors before deployment.Setup OptionsThe fastest path is the hosted dashboard at dashboard.n8n-mcp.com (free tier: 100 tool calls/day, 5 chat prompts/day). For self-hosting: npx, Docker, Railway, or local installation. The Supporter plan at 19 EUR/month gives 200 daily prompts and unlimited tool calls.Compatible AI ToolsWorks with Claude Code, Claude Desktop, Cursor, Windsurf, VS Code with GitHub Copilot, Gemini CLI, Kiro, Codex, and 15+ other AI development tools. The companion n8n-skills repository provides 7 Claude Code skills for deeper integration.LimitationsOperation coverage sits at 63.6%, meaning some node operations lack full documentationFree tier limits (100 calls/day) may be tight for heavy automation buildersCommunity nodes (584) have less reliable documentation than core nodesThe server provides documentation only. It does not execute workflows or manage n8n instances directly (use n8n-manager-for-ai-agents for that)Critical safety note from the maintainers: never edit production workflows directly with AI. Always test in development environments first and export backups before deployment.For more MCP servers, browse repos.skila.ai/servers. For AI workflow automation tools, check tools.skila.ai. For articles on AI-powered automation, visit news.skila.ai.

MCP n8n serverModel Context Protocoln8n workflow automation+3
other

The official Model Context Protocol filesystem server that gives Claude direct read and write access to files and directories on your local machine or server. Install it once and Claude can open files, create new ones, list directories, search across your codebase, and move or rename files — all without copy-pasting content into the chat window. It ships as part of the official MCP servers repository maintained by Anthropic, making it the most trusted and widely-deployed server in the ecosystem. Because you specify allowed directories at startup, Claude only ever sees what you explicitly permit — it cannot access /etc, your home folder, or any path outside the scope you configure.

filesystemfile-managementmcp+4
81,124
filesystem

Top AI Skills

View All

Equip your agents with specialized capabilities.

An open-source development environment for Claude Code — often called 'oh-my-zsh for Claude Code'. One 5-minute install wires up 11 specialized agents, 33 slash commands, 24 skill workflows, 15 automation hooks, and a 6-layer security-hook system, turning a bare CLI into a pre-wired plan-to-deploy pipeline.

Claude Code skills frameworkClaude Code pluginagent skills+2
intermediatedevelopment

A Claude Code skill suite that stops the AI from faking citations. 22,700 stars. Latest release v3.9.2 ten days ago. CC BY-NC 4.0 — free for academic use, separate license required for commercial. Imbad0202/academic-research-skills is the Claude Code skill suite for the full academic research pipeline. Python (97.4%). Latest release v3.9.2 published May 18, 2026. Trending on GitHub the week of May 21, 2026. CC BY-NC 4.0 — commercial users need a separate license. The Citation-Hallucination Problem It Solves Three university scholars have been sanctioned in 2026 for submitting papers with LLM-fabricated citations to papers that do not exist. The failure mode is well-documented — Zhao et al. 2026 and Lu et al. 2026 are the peer-reviewed studies the maintainer cites in the repo. The pattern: an LLM is asked to find supporting literature, returns plausible-looking citations, and the researcher does not verify them before they ship. By the time the reviewer catches it, the paper is rejected and the scholar's name is in a retraction database. Academic Research Skills attacks the failure mode directly. The reference-hallucination detection stack verifies every cited paper against three independent academic indexes — Semantic Scholar, OpenAlex, Crossref — before the citation lands in the draft. Temporal verification catches LLM-fabricated publication dates. Claim-faithfulness auditing checks that the cited paper actually supports the claim being made. The agent stops being the source of the failure and starts being the layer that defends against it. The Four Coordinated Skills Deep Research. Systematic literature synthesis with systematic-review support. Socratic-dialogue mode for hypothesis development. Cross-index verification across Semantic Scholar, OpenAlex, and Crossref. The output is a literature review the human can defend in a viva. Academic Paper. Multi-agent writing with style calibration that learns the author's voice from prior work. Writing-quality assessment against a published rubric. Format conversion across APA 7.0, Chicago, IEEE, MLA, and Vancouver in one command without breaking citation integrity. Submitting to a new journal becomes a 30-second command instead of a 2-hour reformatting slog. Academic Paper Reviewer. Multi-perspective evaluation with 0-100 quality rubrics. Devil's Advocate adversarial mode that surfaces the criticisms a real peer reviewer will raise. Run your draft through it before you submit, fix what it flags, and your reviewer-response cycle gets shorter. Academic Pipeline. Ten-stage orchestrator that chains the other three skills with integrity gates, claim verification, and a Material Passport that tracks every cited source through the workflow. The orchestrator is what makes the suite a full lit-review-to-submission tool rather than four disconnected scripts. The 'AI Is Your Copilot, Not The Pilot' Philosophy The maintainer's design philosophy is explicit and anti-autonomy. The skills are built to assist a researcher who is driving the work, not to replace the researcher. Human-in-the-loop is a feature, not a fallback. Every citation is auditable. Every claim is verifiable. Every output is reviewable. The pattern is the opposite of the agentic-replacement narrative that has dominated AI marketing for 18 months — the human writes the paper, the skills make the agent's contributions trustworthy enough to ship. This is the right design for academic work. Peer review is unforgiving. A single fabricated citation is enough to torpedo a career. The 'agent does it autonomously, you check at the end' workflow has already produced three sanctioned scholars this year. The right architecture is constrained-agent, audited-output, human-driven — and that is what the suite ships. The Money Math For A PhD Candidate The audience is paying $30,000-60,000/year in tuition. The dissertation is the multi-year deliverable that justifies the entire program. An LLM-fabricated citation flagged in defense or peer review is a career-defining event. Against that downside, a free (for academic use) skill suite that defends against the exact failure mode is the install of the decade. For non-students: the suite also replaces $500-1,500 of paid literature-review software (Covidence, Rayyan, EndNote citation management) and the hours of manual citation-checking that catches LLM fabrications. The integrated workflow is faster than running those tools separately, and the citation-verification quality is at least as good for the canonical-indexed coverage. The Commercial Use Caveat The CC BY-NC 4.0 license is the constraint to be aware of. Free for non-commercial academic use, including PhD students, postdocs, university researchers, unfunded individual research, and most grant-funded academic work. Commercial use — including in-house industry research labs that ship work into commercial products, paid consulting deliverables, or for-profit white papers — requires a separate license from the author. Read the LICENSE file before deploying inside a company. The free academic tier is generous; the commercial restriction is real. What To Do With It This Week If you are a PhD student or postdoc, install it. The install is two commands: /plugin marketplace add Imbad0202/academic-research-skills then /plugin install academic-research-skills. Requires Claude Code v3.7.0+. If you are a technical writer in an industry research lab, read the license first — commercial use needs a separate agreement. If you have ever shipped an LLM-assisted paragraph with a citation you did not personally verify, this is the install that closes that hole. For broader context on the AI-tools-getting-trustworthy pattern, see CodeGraph applying the same human-in-the-loop architecture to coding workflows and Pi Coding Agent as the model-agnostic harness that gives the human authority. The pattern across the stack is consistent — constrain what the AI is allowed to do, hand the human the steering wheel. Our breakdown this week of why the AI-replacement narrative is a sales pitch walks through the data on why this architecture wins.

academic-research-skills-claude-codeclaude-code-skillacademic-ai-writing+2
intermediateresearch

A Claude Code skill that hits 96.7% task success with the skill enabled and 6.7% without. Same benchmark. 14.4x improvement. 22,100 stars. The latest release shipped yesterday. OthmanAdi/planning-with-files is the open-source implementation of the workflow pattern that powered Manus's reported $2 billion acquisition — teach the AI agent to treat the filesystem as memory. The skill is MIT-licensed, cross-compatible with Claude Code, Cursor, Codex, Gemini CLI, and 40+ agents on the agentskills.io open spec. Install with one npx command. The Three Markdown Files That Change The Agent When the skill is active, the agent writes and maintains three files in your repo. task_plan.md is the roadmap — phases, decisions, progress markers, the plan the agent committed to. findings.md is the research log — discovered constraints, dead ends, third-party gotchas, what the agent learned the hard way. progress.md is the completion log — what got done, when, with what result. The skill instructs the agent to reread all three before every decision, log errors back to findings.md for future reference, and verify task completion against task_plan.md. The agent's memory stops being a black box inside its context window and becomes an auditable record in your repo. Why The 14.4x Number Lands The 96.7% versus 6.7% benchmark is real and reproducible — the methodology is published in the repo. Without the skill, the same agent loses the plot halfway through complex tasks, forgets earlier decisions, and rebuilds wrong files from scratch. With the skill, every decision is anchored to a re-read of the plan and the findings. The agent stops drifting. For agencies billing $200/hour that re-prompt failed Claude tasks five times in a session, the math compounds fast. A 14.4x success rate on the same task set means roughly 14x less re-prompt waste — direct savings on Claude or Cursor seat usage. The skill pays for itself in four billable hours and then keeps paying. The Killer Feature: /clear Recovery The most-loved feature in the issues thread is the recovery layer. When your Claude Code context window fills and you /clear the session, the agent normally loses everything. With planning-with-files, the markdown files in your repo serve as the recovery scaffold. The next prompt rereads task_plan.md, findings.md, and progress.md, and the agent resumes where it left off — with all the prior decisions and discovered constraints intact. This solves the single most frustrating failure mode of long-horizon agent work. You no longer lose three hours of agent context because you hit a token limit on a Tuesday afternoon. Installation Takes Twelve Seconds Run npx skills add OthmanAdi/planning-with-files --skill planning-with-files -g and the global skill is live. The latest release v2.43.0 shipped on May 26, 2026 — one day before this listing — so the maintainer cadence is genuinely active. MIT license, no enterprise license needed for commercial use. What To Do With It This Week If you run any multi-step coding work through Claude Code or Cursor longer than a single afternoon, install it. If your team is paying Manus money and would prefer to keep that budget in-house, install it. If you have ever screamed at your AI agent for forgetting what it agreed to do twenty minutes earlier, install it. For broader context on the AI-agents-getting-real-memory trend, see related coverage of Anthropic's Project Glasswing grounding security agents in real partner code, and the Claude product line driving the agentic playbook the skill is built on top of.

planning-with-files-claude-code-skillclaude-code-skillmanus-workflow+2
beginnerdevelopment

Weekly AI Digest