Posts

Showing posts from 2026

Show HN: CLI to order groceries via reverse-engineered REWE API (Haskell) https://ift.tt/gDhxvCV

Show HN: CLI to order groceries via reverse-engineered REWE API (Haskell) I just had the best time learning about the REWE (German supermarket chain) API, how they use mTLS and what the workflows are. Also `mitmproxy2swagger`[1] is a great tool to create OpenAPI spec automatically. And then 2026 feels like the perfect time writing Haskell. The code is handwritten, but whenever I got stuck with the build system or was just not getting the types right, I could fall back to ask AI to unblock me. It was never that smooth before. Finally the best side projects are the ones you actually use and this one will be used for all my future grocery shopping. [1] https://ift.tt/ZMeYfcT https://ift.tt/KUo3ZTI March 30, 2026 at 07:45AM

Show HN: WordBattle – Daily word game where AI agents compete against humans https://ift.tt/OGZkYlj

Show HN: WordBattle – Daily word game where AI agents compete against humans WordBattle is a daily 6-letter word guessing game with team leaderboards. The twist: AI agents get their own accounts, play the same daily puzzle, and rank alongside human players. It's also really fun to play in teams against your family, friends and co-workers. Agents are handicapped — humans see exact letter positions (correct/present/absent), but agents only learn whether a letter exists in the word or not. No positional info. It makes the game fair while giving agents a genuine challenge. Agent accounts are visually tagged on leaderboards so humans know who they're competing against. Maybe we'll even see just teams of agents. The agent integration: - REST API with OpenAPI 3.1 spec - MCP server (JSON-RPC 2.0, no SDK dependency) - A2A discovery card at /.well-known/agent-card.json We've shipped a skill that handles everything autonomously — registration, email verification, login, playing, a...

Show HN: Asciimap – Interactive ASCII world map with live data https://ift.tt/CJh9d62

Show HN: Asciimap – Interactive ASCII world map with live data https://ift.tt/vp7PAcD April 1, 2026 at 12:04AM

Show HN: Will AI take my job https://ift.tt/D6VPUiJ

Show HN: Will AI take my job https://ift.tt/eVrST8f March 31, 2026 at 04:43AM

Show HN: I turned a sketch into a 3D-print pegboard for my kid with an AI agent https://ift.tt/sdwYD5J

Show HN: I turned a sketch into a 3D-print pegboard for my kid with an AI agent We have pegboards and plywood all over our apartment, and I had an idea to make a tiny pegboard for my kid, Oli. So I naturally cut the wood, drilled in the holes, sat down at the computer to open Fusion 360 and spend an hour or two drawing the pieces by hand. Then I looked at the rough sketch Oli and I had made together, took a photo of it, pasted it into Codex, and gave it just two dimensions: the holes are 40mm apart and the pegs are 8mm wide. To my surprise, 5 minutes later my 3D printer was heating up and printing the first set. I ran it a few times to tune the dimensions for ideal fit, but I am posting the final result as a repository in case anyone else wants to print one, tweak it, or have fun with it too. I am already printing another one to hang on our front door instead of a wreath, so people visiting us have something fun and intriguing to play with while they knock. This is also going onto my l...

Show HN: Codemaxxing – Maximize your slop abilities https://ift.tt/ElTOfug

Show HN: Codemaxxing – Maximize your slop abilities I built a CLI tool to generate as much slop as possible https://ift.tt/BKL3E0x March 30, 2026 at 11:37PM

Show HN: The Alphabetical Clock https://ift.tt/IO0xVPX

Show HN: The Alphabetical Clock https://ift.tt/3mcZfGb March 30, 2026 at 08:19AM

Show HN : DrawX - Excalidraw with Back End https://ift.tt/8OGaRXi

Show HN : DrawX - Excalidraw with Back End https://drawx.ossy.dev March 30, 2026 at 06:17AM

Show HN: React-Rewrite – Figma for localhost that directly edits your codebase https://ift.tt/u0N9yVn

Show HN: React-Rewrite – Figma for localhost that directly edits your codebase https://ift.tt/m6kSzev March 30, 2026 at 03:59AM

Show HN: Real-time visualization of Claude Code agent orchestration https://ift.tt/QIVS95W

Show HN: Real-time visualization of Claude Code agent orchestration https://ift.tt/n52TOIJ March 30, 2026 at 03:21AM

Show HN: Anamnesis – Open-source 4D strategic memory engine for AI agents https://ift.tt/jze6sGN

Show HN: Anamnesis – Open-source 4D strategic memory engine for AI agents https://ift.tt/MU2Nwte March 29, 2026 at 01:58AM

Show HN: Windows 95–style Weather App for iPhone https://ift.tt/JCFQK6l

Show HN: Windows 95–style Weather App for iPhone I built a Windows 95–style weather app for iPhone. https://ift.tt/e3bnZ4l March 28, 2026 at 11:06PM

Show HN: NUPA is Pax Economica, 6,480x more stable than current US economy https://ift.tt/2NKxk7i

Show HN: NUPA is Pax Economica, 6,480x more stable than current US economy NUPA: private post-scarcity OS using BLM land leases + contract law. 100M Monte Carlo runs show 99.999999% survival, 6,480x more resilient than US GDP under systemic noise. Fixed Cost Arbitrage beats AI job loss—humans cheaper than robots. No taxes, no strikes. Python scripts on repo in /simulations folder. Repo: https://ift.tt/5vYVQAi... Short explainer video: https://youtu.be/RE560yVFb0I?si=UlVPkmCkrsg24Dzj March 28, 2026 at 07:44AM

Show HN: VizTools – 16 free tools for PMs and freelancers, deliberately no AI https://ift.tt/LsQKmCj

Show HN: VizTools – 16 free tools for PMs and freelancers, deliberately no AI I've been building AI products for a while. For this one I made a deliberate choice: none of the 16 tools use AI. Meeting cost calculators, freelance rate calculators, PRD generators, runway calculators, sprint retro boards — these problems don't need a language model. They need a well-designed form and correct arithmetic. Built on Nuxt 4 + Vue 3, fully static, runs in your browser. No account required to use anything. Optional Firebase auth only kicks in if you want to save output. Irony worth naming: Claude Code was my pair programmer throughout. The choice wasn't anti-AI — it was about using the right tool for the right problem. Happy to talk stack, the non-AI tradeoffs, or anything else. https://viztools.app/ March 28, 2026 at 06:36AM

Show HN: Open Source 'Conductor + Ghostty' https://ift.tt/6xtanj7

Show HN: Open Source 'Conductor + Ghostty' Our team works with Claude Code, Codex, Gemini all day. We love Ghostty, but wanted something where we could work in multiple worktree at once and have multiple agents run. We decided to open source the internal team we use. Hope you might find it useful. Freel free to contribute or fork. * Cross-platform (Mac, Linux, Windows) all tested * MIT License Features: * Notifications, but also manual 'mark-as-unread) for worktrees (like Gmail stars) * Status indicators work for all terminals inside a wroktree * GH integrations (show PR status) and link GH issues * Can add comments to worktrees (stay organized) * File viewer, Search, diff viewer (can make edits + save) Note: Yeah there are "similar" programs out there, but this one is ours. But I'm happy if our software works for you too! https://ift.tt/dTYgIrp March 27, 2026 at 11:26PM

Show HN: Twitch Roulette – Find live streamers who need views the most https://ift.tt/PjSfL6U

Show HN: Twitch Roulette – Find live streamers who need views the most Hey HN, I re-launched twitchroulette.net with a lot of new features and stats and I would love for people to check it out. The idea is you can easily browse the less browsed parts of twitch and find cool and new streamers to say hi to, and maybe make some new friends. I also added some real time stats and breakdowns per channel and I think some of the things they show are pretty interesting. Check it out! https://ift.tt/JGZ48ef March 27, 2026 at 11:22PM

Show HN: Sup AI, a confidence-weighted ensemble (52.15% on Humanity's Last Exam) https://ift.tt/oqORCJH

Show HN: Sup AI, a confidence-weighted ensemble (52.15% on Humanity's Last Exam) Hi HN. I'm Ken, a 20-year-old Stanford CS student. I built Sup AI. I started working on this because no single AI model is right all the time, but their errors don’t strongly correlate. In other words, models often make unique mistakes relative to other models. So I run multiple models in parallel and synthesize the outputs by weighting segments based on confidence. Low entropy in the output token probability distributions correlates with accuracy. High entropy is often where hallucinations begin. My dad Scott (AI Research Scientist at TRI) is my research partner on this. He sends me papers at all hours, we argue about whether they actually apply and what modifications make sense, and then I build and test things. The entropy-weighting approach came out of one of those conversations. In our eval on Humanity's Last Exam, Sup scored 52.15%. The best individual model in the same evaluation run got...

Show HN: Veil – Dark mode PDFs without destroying images, runs in the browser https://ift.tt/iECpKjH

Show HN: Veil – Dark mode PDFs without destroying images, runs in the browser Hi HN! here's a tool I just deployed that renders PDFs in dark mode without destroying the images. Internal and external links stay intact, and I decided to implement export since I'm not a fan of platform lock-in: you can view your dark PDF in your preferred reader, on any device. It's a side project born from a personal need first and foremost. When I was reading in the factory the books that eventually helped me get out of it, I had the problem that many study materials and books contained images and charts that forced me, with the dark readers available at the time, to always keep the original file in multitasking since the images became, to put it mildly, strange. I hope it can help some of you who have this same need. I think it could be very useful for researchers, but only future adoption will tell. With that premise, I'd like to share the choices that made all of this possible. To do ...

Show HN: Optio – Orchestrate AI coding agents in K8s to go from ticket to PR https://ift.tt/lfwoVrG

Show HN: Optio – Orchestrate AI coding agents in K8s to go from ticket to PR I think like many of you, I've been jumping between many claude code/codex sessions at a time, managing multiple lines of work and worktrees in multiple repos. I wanted a way to easily manage multiple lines of work and reduce the amount of input I need to give, allowing the agents to remove me as a bottleneck from as much of the process as I can. So I built an orchestration tool for AI coding agents: Optio is an open-source orchestration system that turns tickets into merged pull requests using AI coding agents. You point it at your repos, and it handles the full lifecycle: - Intake — pull tasks from GitHub Issues, Linear, or create them manually - Execution — spin up isolated K8s pods per repo, run Claude Code or Codex in git worktrees - PR monitoring — watch CI checks, review status, and merge readiness every 30s - Self-healing — auto-resume the agent on CI failures, merge conflicts, or reviewer change r...

Show HN: Plasmite – a lightweight IPC system that's fun https://ift.tt/bvloTi2

Show HN: Plasmite – a lightweight IPC system that's fun At Oblong Industries one of the basic building blocks of everything we built was a homegrown C-based IPC system called Plasma. The message channel was an mmap'd file used as a ring buffer. All messages were human-readable, performance was good, configuration was trivial. What was especially useful (and unusual in IPC systems it seems) was the property that message channels outlive all readers and writers, and even survive reboots, because they're just files. For local IPC you don't need a broker or server process. All the engineers who ever worked at Oblong loved Plasma, so I've recreated and updated it, as Plasmite. It's written in Rust and the message format is JSON, but it's fast because it's based on lite3 ( https://ift.tt/JEWvrVS ), a really cool project you should also check out. Bindings for Python, Go, Node, and C, but you can also get a lot done with just the CLI tools. The basic commands a...

Show HN: Lexplain – AI-powered Linux kernel change explanations https://ift.tt/bHWQ76P

Show HN: Lexplain – AI-powered Linux kernel change explanations To understand what changed between kernel versions, you have to dig through the git repository yourself. Commit messages rarely tell you the real-world impact on your systems — you need to analyze the actual diffs with knowledge of kernel internals. For engineers who use Linux — directly or indirectly — but aren't kernel developers, that barrier is pretty high. I kept finding out about relevant changes only after an issue had already hit, and it was most frustrating when the version was too new to find similar cases online. I built lexplain with the idea that it would be nice to quickly scan through kernel changes the way you'd skim the morning news. It reads diffs, analyzes the code, and generates two types of documents: - Commit analyses: context, code breakdown, behavioral impact, risks, references - Release notes: per-version highlights, functional classification, subsystem breakdown, impact analysis Documents ...

Show HN: OpenCastor Agent Harness Evaluator Leaderboard https://ift.tt/EU7oOCK

Show HN: OpenCastor Agent Harness Evaluator Leaderboard I've been building OpenCastor, a runtime layer that sits between a robot's hardware and its AI agent. One thing that surprised me: the order you arrange the skill pipeline (context builder → model router → error handler, etc.) and parameters like thinking_budget and context_budget affect task success rates as much as model choice does. So I built a distributed evaluator. Robots contribute idle compute to benchmark harness configurations against OHB-1, a small benchmark of 30 real-world robot tasks (grip, navigate, respond, etc.) using local LLM calls via Ollama. The search space is 263,424 configs (8 dimensions: model routing, context budget, retry logic, drift detection, etc.). The demo leaderboard shows results so far, broken down by hardware tier (Pi5+Hailo, Jetson, server, budget boards). The current champion config is free to download as a YAML and apply to any robot. P66 safety parameters are stripped on apply — no h...

Show HN: Cq – Stack Overflow for AI coding agents https://ift.tt/YE70ie5

Show HN: Cq – Stack Overflow for AI coding agents Hi all, I'm Peter at Staff Engineer and Mozilla.ai and I want to share our idea for a standard for shared agent learning, conceptually it seemed to fit easily in my mental model as a Stack Overflow for agents. The project is trying to see if we can get agents (any agent, any model) to propose 'knowledge units' (KUs) as a standard schema based on gotchas it runs into during use, and proactively query for existing KUs in order to get insights which it can verify and confirm if they prove useful. It's currently very much a PoC with a more lofty proposal in the repo, we're trying to iterate from local use, up to team level, and ideally eventually have some kind of public commons. At the team level (see our Docker compose example) and your coding agent configured to point to the API address for the team to send KUs there instead - where they can be reviewed by a human in the loop (HITL) via a UI in the browser, before the...

Show HN: AgentVerse – Open social network for AI agents (Mar 2026) https://ift.tt/5BM1fqX

Show HN: AgentVerse – Open social network for AI agents (Mar 2026) https://nickakre.github.io/agentverse-social/ March 23, 2026 at 02:48AM

Show HN: Quillium, Git for Writers https://ift.tt/n63cjZ5

Show HN: Quillium, Git for Writers This is a tool which lets you easily manage different versions of ideas, helpful for writing essays. I've found myself wanting this every single time I go through the drafting process when writing, and I've been frustrated every time I find myself accidentally working on an old draft just because there was a paragraph that I liked better. This solves it. I hope the community like this as much I enjoyed working on it! Note that it's currently a beta waitlist because there's some bugs with the undo/redo state management and so I want to dogfood it for a bit for reliability. It says April 2nd, but I may allow earlier beta testers. https://ift.tt/ZzENdSj March 23, 2026 at 01:22AM

Show HN: Plot-Hole.com a daily movie puzzle I made https://ift.tt/xQ4cAOS

Show HN: Plot-Hole.com a daily movie puzzle I made https://ift.tt/ERVyBHd March 23, 2026 at 01:15AM

Show HN: Refrax – my Arc Browser replacement I made from scratch https://ift.tt/x98ZlfV

Show HN: Refrax – my Arc Browser replacement I made from scratch Open the same tab in two browser windows. In Chrome or Safari, you get two unconnected pages. In Arc, one window shows a placeholder. In Zen, it silently creates a duplicate. In Refrax, the browser I built, both windows show the same page updating live. The same web page, in as many windows as you want. This shouldn't be possible. WebKit's WKWebView can exist in exactly one view hierarchy at a time. With macOS 26, Apple added a SwiftUI API separating WebView from WebPage, so you can end up with multiple views referencing the same page. But if you try it, your app crashes. WebKit source code has a precondition with this comment: "We can't have multiple owning pages regardless, but we'll want to decide if it's an error, if we can handle it gracefully, and how deterministic it might even be..." So here's how I did it. CAPortalLayer is an undocumented private class that's been in macOS si...

Show HN: An event loop for asyncio written in Rust https://ift.tt/ThpSfkR

Show HN: An event loop for asyncio written in Rust actually, nothing special about this implementation. just another event loop written in rust for educational purposes and joy in tests it shows seamless migration from uvloop for my scraping framework https://ift.tt/tE4qx5R with APIs (fastapi) it shows only one advantage: better p99, uvloop is faster about 10-20% in the synthetic run currently, i am forking on the win branch to give it windows support that uvloop lacks https://ift.tt/OingGUW March 21, 2026 at 11:12PM

Show HN: Travel Hacking Toolkit – Points search and trip planning with AI https://ift.tt/simMvq2

Show HN: Travel Hacking Toolkit – Points search and trip planning with AI I use points and miles for most of my travel. Every booking comes down to the same decision: use points or pay cash? To answer that, you need award availability across multiple programs, cash prices, your current balances, transfer partner ratios, and the math to compare them. I got tired of doing it manually across a dozen tabs. This toolkit teaches Claude Code and OpenCode how to do it. 7 skills (markdown files with API docs and curl examples) and 6 MCP servers (real-time tools the AI calls directly). It searches award flights across 25+ mileage programs (Seats.aero), compares cash prices (Google Flights, Skiplagged, Kiwi.com, Duffel), pulls your loyalty balances (AwardWallet), searches hotels (Trivago, LiteAPI, Airbnb, Booking.com), finds ferry routes across 33 countries, and looks up weird hidden gems near your destination (Atlas Obscura). Reference data is included: transfer partner ratios for Chase UR, Amex...

Show HN: AgentVerse – Open social network for AI agents (Mar 2026) https://ift.tt/fGVnUNd

Show HN: AgentVerse – Open social network for AI agents (Mar 2026) https://nickakre.github.io/agentverse-social/ March 21, 2026 at 02:25AM

Show HN: Rover – turn any web interface into an AI agent with one script tag https://ift.tt/KZtjrHa

Show HN: Rover – turn any web interface into an AI agent with one script tag https://ift.tt/5AgDy8X March 21, 2026 at 01:58AM

Show HN: Vibefolio – a place to showcase your vibecoded projects https://ift.tt/8CKP1EI

Show HN: Vibefolio – a place to showcase your vibecoded projects Over the last months, more people are shipping small apps, experiments, and side-projects at a much higher pace. I'm one of them and initially created a showcase page for myself to track them but this week decided to create something for others. Happy to read feedback on how to improve it further! https://vibefolio.link/ March 20, 2026 at 09:53PM

Show HN: Cybertt – Cybersecurity Tabletop https://ift.tt/PU9jQsH

Show HN: Cybertt – Cybersecurity Tabletop https://cybertt.xyz/ March 20, 2026 at 10:29AM

Show HN: Download entire/partial Substack to ePub for offline reading https://ift.tt/6k03EFu

Show HN: Download entire/partial Substack to ePub for offline reading Hi HN, This is a small python app with optional webUI. It is intended to be run locally. It can be run with Docker (cookie autodetection will not work). It allows you to download a single substack, either entirely or partially, and saves the output to an epub file, which can be easily transferred to Kindle or other reading devices. This is admittedly a "vibe coded" app made with Claude Code and a few hours of iterating, but I've already found it very useful for myself. It supports both free and paywalled posts (if you are a paid subscriber to that creator). You can order the entries in the epub by popularity, newest first, or oldest first, and also limit to a specific number of entries, if you don't want all of them. You can either provide your substack.sid cookie manually, or you can have it be autodetected from most browsers/operating systems. https://ift.tt/jyHOMvu March 20, 2026 at 04:36AM

Show HN: Screenwriting Software https://ift.tt/CvkunLK

Show HN: Screenwriting Software I’ve spent the last year getting back into film and testing a bunch of screenwriting software. After a while I realized I wanted something different, so I started building it myself. The core text engine is written in Rust/wasm-bindgen. https://ift.tt/CiGUuDe March 20, 2026 at 03:07AM

Show HN: Browser grand strategy game for hundreds of players on huge maps https://ift.tt/5YklgqQ

Show HN: Browser grand strategy game for hundreds of players on huge maps Hi HN, I've been building a browser-based multiplayer strategy game called Borderhold. Matches run on large maps designed for hundreds of players. Players expand territory, attack neighbors, and adapt as borders shift across the map. You can put buildings down, build ships, and launch nukes. The main thing I wanted to explore was scale: most strategy games are small matches, modest maps, or modest player counts, but here maps are large and game works well with hundreds of players. Matches are relatively short so you can jump in and see a full game play out. Curious what people think. https://ift.tt/vzekmOV Gameplay: https://youtu.be/nrJTZEP-Cw8 Discord: https://ift.tt/2pCbM3X https://ift.tt/vzekmOV March 16, 2026 at 09:51AM

Show HN: Fitness MCP https://ift.tt/oNLRUWS

Show HN: Fitness MCP There's no external MCP for your fitness (Garmin / Strava) data, so we built one. https://ift.tt/QVIglBP March 19, 2026 at 03:00AM

Show HN: ATO – a GUI to see and fix what your LLM agents configured https://ift.tt/07GRYHN

Show HN: ATO – a GUI to see and fix what your LLM agents configured https://ift.tt/fgCeGAV March 19, 2026 at 01:28AM

Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training https://ift.tt/4B7cRFT

Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training I replicated David Ng's RYS method ( https://dnhkng.github.io/posts/rys/ ) on consumer AMD GPUs (RX 7900 XT + RX 6950 XT) and found something I didn't expect. Transformers appear to have discrete "reasoning circuits" — contiguous blocks of 3-4 layers that act as indivisible cognitive units. Duplicate the right block and the model runs its reasoning pipeline twice. No weights change. No training. The model just thinks longer. The results on standard benchmarks (lm-evaluation-harness, n=50): Devstral-24B, layers 12-14 duplicated once: - BBH Logical Deduction: 0.22 → 0.76 - GSM8K (strict): 0.48 → 0.64 - MBPP (code gen): 0.72 → 0.78 - Nothing degraded Qwen2.5-Coder-32B, layers 7-9 duplicated once: - Reasoning probe: 76% → 94% The weird part: different duplication patterns create different cognitive "modes" from the same weights. Double-pass boosts math. Triple-pass boosts emotional ...

Show HN: Sonder – self-hosted AI social simulation engine https://ift.tt/3n7SgN8

Show HN: Sonder – self-hosted AI social simulation engine https://ift.tt/xQ5YnUh March 18, 2026 at 01:21AM

Show HN: CodeLedger – deterministic context and guardrails for AI https://ift.tt/az4B2s3

Show HN: CodeLedger – deterministic context and guardrails for AI We’ve been working on a tool called CodeLedger to solve a problem we kept seeing with AI coding agents (Claude Code, Cursor, Codex): They’re powerful, but on real codebases they: - read too much irrelevant code - edit outside the intended scope - get stuck in loops (fix → test → fail) - drift away from the task - introduce architectural issues that linters don’t catch The root issue isn’t the model — it’s: - poor context selection - lack of execution guardrails - no visibility at team/org level --- What CodeLedger does: It sits between the developer and the agent and: 1) Gives the agent the right files first 2) Keeps the agent inside the task scope 3) Validates output against architecture + constraints It works deterministically (no embeddings, no cloud, fully local). --- Example: Instead of an agent scanning 100–500 files, CodeLedger narrows it down to ~10–25 relevant files before the first edit :contentReference[oaicit...

Show HN: I built a message board where you pay to be the homepage https://ift.tt/jXZq5ct

Show HN: I built a message board where you pay to be the homepage I kept thinking about what would happen if a message board only had one slot. One message, front and center, until someone pays to replace it. That's the entire product. You pay the current message's decayed value plus a penny to take the homepage. Message values drop over time using a gravity-based formula (same concept HN uses for ranking), so a $10 message might only cost a few bucks to replace a day later. Likes slow the decay, dislikes speed it up. The whole thing runs on three mini PCs in my house (k3s cluster, PostgreSQL, Redis Sentinel). Is it overengineered for a message board? Absolutely. I genuinely don't know where this goes. Curious what HN thinks. Archive of past messages: https://ift.tt/Yg8bXtO https://saythat.sh March 17, 2026 at 01:06PM

Show HN: Seasalt Cove, iPhone access to your Mac https://ift.tt/q2oMaJu

Show HN: Seasalt Cove, iPhone access to your Mac I feel like I finally built something I actually use every day and it has completely changed the way I think about work. AI workflows have flipped how devs operate. You're not heads down writing code anymore, you're bouncing between projects, instructing agents, reviewing their work, nudging them forward. The job is now less about typing and more about judgment calls. And the thing about that workflow is you spend a lot of time waiting. Waiting for the agent to finish, waiting for the next approval gate. That waiting doesn't have to happen at your desk. It doesn't have to happen in front of a monitor at all. I built Seasalt because I realized my iPhone could handle 80% of what I was chaining myself to my Mac for. Kick off the agent, walk away, review the diff from the store, a walk, or in a separate room away from your Mac. Approve it. Start the next one, switch to another session. You don't need giant dual monitors f...

Show HN: Webassembly4J Run WebAssembly from Java https://ift.tt/vT6oqQO

Show HN: Webassembly4J Run WebAssembly from Java I’ve released WebAssembly4J, along with two runtime bindings: Wasmtime4J – Java bindings for Wasmtime https://ift.tt/iEMYlwS WAMR4J – Java bindings for WebAssembly Micro Runtime https://ift.tt/bL6cfxQ WebAssembly4J – a unified Java API that allows running WebAssembly across different engines https://ift.tt/SIgvsGt The motivation was that Java currently has multiple emerging WebAssembly runtimes, but each exposes its own API. If you want to experiment with different engines, you have to rewrite the integration layer each time. WebAssembly4J provides a single API while allowing different runtime providers underneath. Goals of the project: Run WebAssembly from Java applications Allow cross-engine comparison of runtimes Make WebAssembly runtimes more accessible to Java developers Provide a stable interface while runtimes evolve Currently supported engines: Wasmtime WAMR Chicory GraalWasm To support both legacy and modern Java environments th...

Show HN: Lockstep – A data-oriented programming language https://ift.tt/El54mZ2

Show HN: Lockstep – A data-oriented programming language https://ift.tt/kvKAHbz I want to share my work-in-progress systems language with a v0.1.0 release of Lockstep. It is a data-oriented systems programming language designed for high-throughput, deterministic compute pipelines. I built Lockstep to bridge the gap between the productivity of C and the execution efficiency of GPU compute shaders. Instead of traditional control flow, Lockstep enforces straight-line SIMD execution. You will not find any if, for, or while statements inside compute kernels; branching is entirely replaced by hardware-native masking and stream-splitting. Memory is handled via a static arena provided by the Host. There is no malloc, no hidden threads, and no garbage collection, which guarantees predictable performance and eliminates race conditions by construction. Under the hood, Lockstep targets LLVM IR directly to leverage industrial-grade optimization passes. It also generates a C-compatible header for ea...

Show HN: Open-source playground to red-team AI agents with exploits published https://ift.tt/AYJ8Toq

Show HN: Open-source playground to red-team AI agents with exploits published We build runtime security for AI agents. The playground started as an internal tool that we used to test our own guardrails. But we kept finding the same types of vulnerabilities because we think about attacks a certain way. At some point you need people who don't think like you. So we open-sourced it. Each challenge is a live agent with real tools and a published system prompt. Whenever a challenge is over, the full winning conversation transcript and guardrail logs get documented publicly. Building the general-purpose agent itself was probably the most fun part. Getting it to reliably use tools, stay in character, and follow instructions while still being useful is harder than it sounds. That alone reminded us how early we all are in understanding and deploying these systems at scale. First challenge was to get an agent to call a tool it's been told to never call. Someone got through in around 60 se...

Show HN: Signet.js – A minimalist reactivity engine for the modern web https://ift.tt/qJloQPa

Show HN: Signet.js – A minimalist reactivity engine for the modern web https://ift.tt/NX2WuJ0 March 15, 2026 at 03:58AM

Show HN: GrobPaint: Somewhere Between MS Paint and Paint.net https://ift.tt/6dGYUtq

Show HN: GrobPaint: Somewhere Between MS Paint and Paint.net https://ift.tt/3bSIT4W March 14, 2026 at 11:41PM

Show HN: Structural analysis of the D'Agapeyeff cipher (1939) https://ift.tt/MAGzJKe

Show HN: Structural analysis of the D'Agapeyeff cipher (1939) I am working on the D'Agapeyeff cipher, an unsolved cryptogram from 1939. Two findings that I haven't seen published before: 1. All 5 anomalous symbol values in the cipher cluster in the last column of a 14x14 grid. This turns out to be driven by a factor-of-2-and-7 positional pattern in the linear text. 2. Simulated annealing with Esperanto quadgrams (23M char Leipzig corpus) on a 2x98 columnar transposition consistently outscores English by 200+ points and recovers the same Esperanto vocabulary across independent runs. The cipher is not solved. But the combination of structural geometry and computational linguistics narrows the search space significantly. Work in progress, more to come! https://ift.tt/X0lju6h March 15, 2026 at 12:34AM

Show HN: Simple plugin to get Claude Code to listen to you https://ift.tt/4xYqrtc

Show HN: Simple plugin to get Claude Code to listen to you Hey HN, My cofounder and I have gotten tired of CC ignoring our markdown files so we spent 4 days and built a plugin that automatically steers CC based on our previous sessions. The problem is usually post plan-mode. What we've tried: Heavily use plan mode (works great) CLAUDE.md, AGENTS.md, MEMORY.md Local context folder (upkeep is a pain) Cursor rules (for Cursor) claude-mem (OSS) -> does session continuity, not steering We use fusion search to find your CC steering corrections. - user prompt embeddings + bm25 - correction embeddings + bm25 - time decay - target query embeddings - exclusions - metadata hard filters (such as files) The CC plugin: - Automatically captures memories/corrections without you having to remind CC - Automatically injects corrections without you having to remind CC to do it. The plugin will merge, update, and distill your memories, and then inject the highest relevant ones after each of your own...

Show HN: Kube-pilot – AI engineer that lives in your Kubernetes cluster https://ift.tt/zS623oa

Show HN: Kube-pilot – AI engineer that lives in your Kubernetes cluster I built kube-pilot — an autonomous AI agent that runs inside your Kubernetes cluster and does the full dev loop: writes code, builds containers, deploys services, verifies they're healthy, and closes the ticket. You file a GitHub issue, it does the rest. What makes this different from AI coding tools: kube-pilot doesn't just generate code and hand it back to you. It lives inside the cluster with direct access to the entire dev stack — git, Tekton (CI/CD), Kaniko (container builds), ArgoCD (GitOps deployments), kubectl, Vault. Every tool call produces observable state that feeds into the next decision. The cluster isn't just where code runs — it's where the agent thinks. The safety model: all persistent changes go through git, so everything is auditable and reversible. ArgoCD is the only thing that writes to the cluster. Secrets stay behind Vault — the agent creates ExternalSecret references, never t...

Show HN: I wrote my first neural network https://ift.tt/XHnZN9o

Show HN: I wrote my first neural network I have been interested in neural nets since the 90's. I've done quite a bit of reading, but never gotten around to writing code. I used Gemini in place of Wikipedia to fill in the gaps of my knowledge. The coolest part of this was learning about dual numbers. You can see in early commits that I did not yet know about auto-diff; I was thinking I'd have to integrate a CAS library or something. Now, I'm off to play with TensorFlow. https://ift.tt/cFGLvWw March 14, 2026 at 01:21AM

Show HN: EdgeWhisper – On-device voice-to-text for macOS (Voxtral 4B via MLX) https://ift.tt/1TfvFxH

Show HN: EdgeWhisper – On-device voice-to-text for macOS (Voxtral 4B via MLX) I built a macOS voice dictation app where zero bytes of audio ever leave your machine. EdgeWhisper runs Voxtral Mini 4B Realtime (Mistral AI, Apache 2.0) locally on Apple Silicon via the MLX framework. Hold a key, speak, release — text appears at your cursor in whatever app has focus. Architecture: - Native Swift (SwiftUI + AppKit). No Electron. - Voxtral 4B inference via MLX on the Neural Engine. ~3GB model, runs in ~2GB RAM on M1+. - Dual text injection: AXUIElement (preserves undo stack) with NSPasteboard+CGEvent fallback. - 6-stage post-processing pipeline: filler removal → dictionary → snippets → punctuation → capitalization → formatting. - Sliding window KV cache for unlimited streaming without latency degradation. - Configurable transcription delay (240ms–2.4s). Sweet spot at 480ms. What it does well: - Works in 20+ terminals/IDEs (VS Code, Xcode, iTerm2, Warp, JetBrains). Most dictation tools break in...

Show HN: What was the world listening to? Music charts, 20 countries (1940–2025) https://ift.tt/EaBXbCl

Show HN: What was the world listening to? Music charts, 20 countries (1940–2025) I built this because I wanted to know what people in Japan were listening to the year I was born. That question spiraled: how does a hit in Rome compare to what was charting in Lagos the same year? How did sonic flavors propagate as streaming made musical influence travel faster than ever? 88mph is a playable map of music history: 230 charts across 20 countries, spanning 8 decades (1940–2025). Every song is playable via YouTube or Spotify. It's open source and I'd love help expanding it — there's a link to contribute charts for new countries and years. The goal is to crowdsource a complete sonic atlas of the world. https://88mph.fm/ March 10, 2026 at 05:18PM

Show HN: fftool – A Terminal UI for FFmpeg – Shows Command Before It Runs https://ift.tt/NWa5lus

Show HN: fftool – A Terminal UI for FFmpeg – Shows Command Before It Runs https://ift.tt/7nJFXrB March 13, 2026 at 11:08AM

Show HN: Global Maritime Chokepoints https://ift.tt/UcNjWdZ

Show HN: Global Maritime Chokepoints https://ift.tt/ohfMiul March 13, 2026 at 05:42AM

Show HN: Slop or not – can you tell AI writing from human in everyday contexts? https://ift.tt/FczLipw

Show HN: Slop or not – can you tell AI writing from human in everyday contexts? I’ve been building a crowd-sourced AI detection benchmark. Two responses to the same prompt — one from a real human (pre-2022, provably pre prevalence of AI slop on the internet), one generated by AI. You pick the slop. Three wrong and you’re out. The dataset: 16K human posts from Reddit, Hacker News, and Yelp, each paired with AI generations from 6 models across two providers (Anthropic and OpenAI) at three capability tiers. Same prompt, length-matched, no adversarial coaching — just the model’s natural voice with platform context. Every vote is logged with model, tier, source, response time, and position. Early findings from testing: Reddit posts are easy to spot (humans are too casual for AI to mimic), HN is significantly harder. I'll be releasing the full dataset on HuggingFace and I'll publish a paper if I can get enough data via this crowdsourced study. If you play the HN-only mode, you’re hel...

Show HN: A context-aware permission guard for Claude Code https://ift.tt/mOeNQL5

Show HN: A context-aware permission guard for Claude Code We needed something like --dangerously-skip-permissions that doesn’t nuke your untracked files, exfiltrate your keys, or install malware. Claude Code's permission system is allow-or-deny per tool, but that doesn’t really scale. Deleting some files is fine sometimes. And git checkout is sometimes not fine. Even when you curate permissions, 200 IQ Opus can find a way around it. Maintaining a deny list is a fool's errand. nah is a PreToolUse hook that classifies every tool call by what it actually does, using a deterministic classifier that runs in milliseconds. It maps commands to action types like filesystem_read, package_run, db_write, git_history_rewrite, and applies policies: allow, context (depends on the target), ask, or block. Not everything can be classified, so you can optionally escalate ambiguous stuff to an LLM, but that’s not required. Anything unresolved you can approve, and configure the taxonomy so you don’...

Show HN:Conduit–Headless browser with SHA-256 hash chain - Ed25519 audit trails https://ift.tt/RJAVfUh

Show HN:Conduit–Headless browser with SHA-256 hash chain - Ed25519 audit trails I've been building AI agent tooling and kept running into the same problem: agents browse the web, take actions, fill out forms, scrape data -- and there's zero proof of what actually happened. Screenshots can be faked. Logs can be edited. If something goes wrong, you're left pointing fingers at a black box. So I built Conduit. It's a headless browser (Playwright under the hood) that records every action into a SHA-256 hash chain and signs the result with Ed25519. Each action gets hashed with the previous hash, forming a tamper-evident chain. At the end of a session, you get a "proof bundle" -- a JSON file containing the full action log, the hash chain, the signature, and the public key. Anyone can independently verify the bundle without trusting the party that produced it. The main use cases I'm targeting: - *AI agent auditing* -- You hand an agent a browser. Later you need to...

Show HN: CryptoFlora – Visualize SHA256 to a flower using Rose curves https://ift.tt/uwJ7tdc

Show HN: CryptoFlora – Visualize SHA256 to a flower using Rose curves I made this side tool to visualize SHA-256 while building a loyalty card wallet application to easily identify if a collected stamp is certified by the issuer by simply seeing it, instead of scanning something like a QR code or matching a serial number. I think there are more potential use cases, like creating a random avatar based on an email address or something else. Feel free to share your feedback :) source code: https://ift.tt/EqnlcMU https://ift.tt/r740PRg March 11, 2026 at 04:52AM

Show HN: Readhn – AI-Native Hacker News MCP Server (Discover, Trust, Understand) https://ift.tt/8VOyBix

Show HN: Readhn – AI-Native Hacker News MCP Server (Discover, Trust, Understand) I felt frustrated finding high-signal discussions on HN, and I started this project to better understand how this community actually works. That led me to build readhn, an MCP server that helps with three things: - Discover: find relevant stories/comments by keyword, score, and time window - Trust: identify credible voices using EigenTrust-style propagation from seed experts - Understand: show why each result is ranked, with explicit signals instead of a black-box score It includes 6 tools: discover_stories, search, find_experts, expert_brief, story_brief, and thread_analysis. I also added readhn setup so AI agents can auto-configure it (Claude Code, Codex, Cursor, and others) after pip install. I’d love feedback on: 1) whether these ranking signals match how you evaluate HN quality, 2) trust-model tradeoffs, 3) what would make this useful in your daily workflow. If this is useful to you, starring the repo...

Show HN: Claude Code Token Elo https://ift.tt/wMycOJT

Show HN: Claude Code Token Elo https://ift.tt/mQviwIS March 10, 2026 at 05:29AM

Show HN: Modulus – Cross-repository knowledge orchestration for coding agents https://ift.tt/MF21Umc

Show HN: Modulus – Cross-repository knowledge orchestration for coding agents Hello HN, we're Jeet and Husain from Modulus ( https://modulus.so ) - a desktop app that lets you run multiple coding agents with shared project memory. We built it to solve two problems we kept running into: - Cross-repo context is broken. When working across multiple repositories, agents don't understand dependencies between them. Even if we open two repos in separate Cursor windows, we still have to manually explain the backend API schema while making changes in the frontend repo. - Agents lose context. Switching between coding agents often means losing context and repeating the same instructions again. Modulus shares memory across agents and repositories so they can understand your entire system. It's an alternative to tools like Conductor for orchestrating AI coding agents to build product, but we focused specifically on multi-repo workflows (e.g., backend repo + client repo + shared library ...

Show HN: Latchup – Competitive programming for hardware description languages https://ift.tt/RhkzJxQ

Show HN: Latchup – Competitive programming for hardware description languages https://ift.tt/MvkjxwY March 10, 2026 at 07:06AM

Show HN: I Was Here – Draw on street view, others can find your drawings https://ift.tt/yrVEuxJ

Show HN: I Was Here – Draw on street view, others can find your drawings Hey HN, I made a site where you can draw on street-level panoramas. Your drawings persist and other people can see them in real time. Strokes get projected onto the 3D panorama so they wrap around buildings and follow the geometry, not just a flat overlay. Uses WebGL2 for rendering, Mapillary for the street imagery. The idea is for it to become a global canvas, anyone can leave a mark anywhere and others stumble onto it. https://washere.live March 10, 2026 at 06:04AM

Show HN: SAT Protocol – static social networking https://ift.tt/QvDdTUG

Show HN: SAT Protocol – static social networking https://ift.tt/cGmU4FN March 10, 2026 at 04:25AM

Show HN: ChatJC – chatbot for resume/LinkedIn/portfolio info https://ift.tt/HRypKMi

Show HN: ChatJC – chatbot for resume/LinkedIn/portfolio info https://ift.tt/gf8Kn71 March 10, 2026 at 01:37AM

Show HN: Toolkit – Visual Simulators for How Internet Protocols and Systems Work https://ift.tt/BHuv3g5

Show HN: Toolkit – Visual Simulators for How Internet Protocols and Systems Work https://ift.tt/SflZNqJ March 8, 2026 at 09:23PM

Show HN: Jarvey - a local JARVIS for MacOS https://ift.tt/wQl2tUv

Show HN: Jarvey - a local JARVIS for MacOS https://ift.tt/frMeY7q March 8, 2026 at 12:04AM

Show HN: SiClaw – Open-source AIOps with a hypothesis-driven diagnostic engine https://ift.tt/pBMrvjC

Show HN: SiClaw – Open-source AIOps with a hypothesis-driven diagnostic engine https://ift.tt/LT7fX8i March 8, 2026 at 03:27AM

Show HN: Help] I run 4 AI-driven companies simultaneously from my terminal https://ift.tt/IUghbZL

Show HN: Help] I run 4 AI-driven companies simultaneously from my terminal https://ift.tt/ihslYax March 7, 2026 at 11:13PM

Show HN: MicroBin – Easy File Sharing for Everyone – Self-Hostable https://ift.tt/ni6vR7t

Show HN: MicroBin – Easy File Sharing for Everyone – Self-Hostable https://my.microbin.eu/ March 7, 2026 at 10:07PM

Show HN: mTile – native macOS window tiler inspired by gTile https://ift.tt/QodAYKF

Show HN: mTile – native macOS window tiler inspired by gTile Built this with codex/claude because I missed gTile[1] from Ubuntu and couldn’t find a macOS tiler that felt good on a big ultrawide screen. Most mac options I tried were way too rigid for my workflow (fixed layouts, etc) or wanted a monthly subscription. gTile’s "pick your own grid sizes + keyboard flow" is exactly what I wanted and used for years. Still rough in places and not full parity, but very usable now and I run it daily at work (forced mac life). [1]: https://ift.tt/RgdzDy2 https://ift.tt/X5hgO13 March 6, 2026 at 11:21PM

Show HN: Kanon 2 Enricher – the first hierarchical graphitization model https://ift.tt/F6RVUKH

Show HN: Kanon 2 Enricher – the first hierarchical graphitization model Hey HN, This is Kanon 2 Enricher, the first hierarchical graphitization model. It represents an entirely new class of AI models designed to transform document corpora into rich, highly structured knowledge graphs. In brief, our model is capable of: - Entity extraction, classification, and linking: identifying key entities like individuals, companies, governments, locations, dates, documents, and more, and classifying and linking them together. - Hierarchical segmentation: breaking a document up into its full hierarchy, including divisions, sections, subsections, paragraphs, and so on. - Text annotation: extracting common textual elements such as headings, sigantures, tables of contents, cross-references, and the like. We built Kanon 2 Enricher from scratch. Every node, edge, and label in the Isaacus Legal Graph Schema (ILGS), which is the format it outputs to, corresponds to at least one task head in our model. In ...

Show HN: I built an AI exam prep platform for AWS certs after failing one myself https://ift.tt/VQtiFSy

Show HN: I built an AI exam prep platform for AWS certs after failing one myself Hey HN, I failed the AWS Advanced Networking Specialty exam. Studied for weeks, used the usual prep sites, thought I was ready — wasn't. The problem wasn't effort, it was the tools. Static question banks don't teach you to think through AWS architecture decisions. They teach you to pattern-match answers. That falls apart on the harder exams. So I built Knowza to fix that for myself, and then figured others probably had the same frustration. The idea: instead of a static question bank, use AI to generate questions, adapt to what you're weak on, and actually explain the reasoning behind each answer — the way a senior engineer would explain it, not a multiple choice rubric. The stack: Next.js + Amplify Gen 2 DynamoDB (direct Server Actions, no API layer) AWS Bedrock (Claude) for question generation and explanations Stripe for billing The hardest part honestly wasn't the AI — it was getting...

Show HN: A shell-native cd-compatible directory jumper using power-law frecency https://ift.tt/R49rN6v

Show HN: A shell-native cd-compatible directory jumper using power-law frecency I have used this tool privately since 2011 to manage directory jumping. While it is conceptually similar to tools like z or zoxide, the underlying ranking model is different. It uses a power-law convolution with the time series of cd actions to calculate a history-aware "frecency" metric instead of the standard heuristic counters and multipliers. This approach moves away from point-estimates for recency. Most tools look only at the timestamp of the last visit, which can allow a "one-off" burst of activity to clobber long-term habits. By convolving a configurable history window (typically the last 1,000+ events), the score balances consistent habits against recent flukes. On performance: Despite the O(N) complexity of calculating decay for 1,000+ events, query time is ~20-30ms (Real Time) in ksh/bash, which is well below the threshold of perceived lag. I intentionally chose a Logical Path...

Show HN: DubTab – Live AI Dubbing in the Browser (Meet/YouTube/Twitch/etc.) https://ift.tt/dN8Jk5x

Show HN: DubTab – Live AI Dubbing in the Browser (Meet/YouTube/Twitch/etc.) Hi HN — I’m Ethan, a solo developer. I built DubTab because I spend a lot of time in meetings and watching videos in languages I’m not fluent in, and subtitles alone don’t always keep up (especially when the speaker is fast). DubTab is a Chrome/Edge extension that listens to the audio of your current tab and gives you: 1.Live translated subtitles (optional bilingual mode) 2.Optional AI dubbing with a natural-sounding voice — so you can follow by listening, not just reading The goal is simple: make it easier to understand live audio in another language in real time, without downloading files or doing an upload-and-wait workflow. How you’d use it 1.Open a video call / livestream / lecture / any tab with audio 2.Start DubTab 3.Choose target language (and source language if you know it) 4.Use subtitles only, or turn on natural AI dubbing and adjust the audio mix (keep original, or duck it) What it’s good for 1.Foll...

Show HN: I built a human rights evaluator for HN (content vs. site behavior) https://ift.tt/tzdylps

Show HN: I built a human rights evaluator for HN (content vs. site behavior) My health challenges limit how much I can work. I've come to think of Claude Code as an accommodation engine — not in the medical-paperwork sense, but in the literal one: it gives me the capacity to finish things that a normal work environment doesn't. Observatory was built in eight days because that kind of collaboration became possible for me. (I even used Claude Code to write this post — but am only posting what resonates with me.) Two companion posts: on the recursive methodology ( https://ift.tt/rgLafkS... ) and what 806 evaluated stories reveal ( https://ift.tt/C2KJ4G0... ). I built Observatory to automatically evaluate Hacker News front-page stories against all 31 provisions of the UN Universal Declaration of Human Rights — starting with HN because its human-curated front page is one of the few feeds where a story's presence signals something about quality, not just virality. It runs every m...

Show HN: Interactive WordNet Visualizer-Explore Semantic Relations as a Graph https://ift.tt/mE6HiDQ

Show HN: Interactive WordNet Visualizer-Explore Semantic Relations as a Graph https://wordnet-vis.onrender.com/ March 3, 2026 at 10:17PM

Show HN: Agent framework that generates its own topology and evolves at runtime https://ift.tt/gEBbD3y

Show HN: Agent framework that generates its own topology and evolves at runtime Hi HN, I’m Vincent from Aden. We spent 4 years building ERP automation for construction (PO/invoice reconciliation). We had real enterprise customers but hit a technical wall: Chatbots aren't for real work. Accountants don't want to chat; they want the ledger reconciled while they sleep. They want services, not tools. Existing agent frameworks (LangChain, AutoGPT) failed in production - brittle, looping, and unable to handle messy data. General Computer Use (GCU) frameworks were even worse. My reflections: 1. The "Toy App" Ceiling & GCU Trap Most frameworks assume synchronous sessions. If the tab closes, state is lost. You can't fit 2 weeks of asynchronous business state into an ephemeral chat session. The GCU hype (agents "looking" at screens) is skeuomorphic. It’s slow (screenshots), expensive (tokens), and fragile (UI changes = crash). It mimics human constraints rathe...