Back to News
Advertisement
AAG342 1 day ago 22 commentsRead Article on traceapp.info

ZH version is available. Content is displayed in original English for accuracy.

I'm the developer of Trace, a non-intrusive, shortcut-driven Mac app that records and transcribes your meetings on-device. I know, another meeting transcription app. Please bear with me though, I'm confident that this is at least a little novel.

I primarily built Trace for myself. I'd been using MacWhisper, but there was enough fiddling before each call that I'd forget to start it and walk out of an hour-long meeting with nothing written down. So the things I cared about most were that it's quick to activate and stays out of the way. You activate Trace by pressing a global shortcut (configurable), which reveals a small bar at the bottom of your screen (there's also a keystroke and/or option to hide it entirely if you'd rather not see it at all).

As I was building it I wanted to bake in a couple of workflows I'd wished for in other transcription apps.

1. Mid-meeting you can press another global shortcut to mark a "key moment" and type a note. The note shows up in the resulting transcript inline at that timestamp. I wanted to add this because I kept catching myself thinking "wait, that bit matters" in meetings and reaching to jot it down in a separate app like Obsidian, which I then needed to add context to, which took me out of the meeting. I use it all the time. If I paste the transcript into an LLM afterwards (which I find myself doing more and more these days) the important moments are flagged so it doesn't gloss over them. This is more noticeable in longer meetings with lots of topics. 2. With another keyboard shortcut you can summon a rough live recap (subtitles, basically) to quickly recap what's just been said.

Trace uses standard macOS microphone and system recording APIs to capture both sides of the conversation as two separate tracks and then runs the system side through on-device diarization to identify speakers. Right now we only label them as "Speaker 1", "Speaker 2", etc but there are plans for speaker labelling in the future. You can also show a "live recap" as the call is happening to review what someone just said.

All transcription models run on your machine. To be clear though, Trace doesn't do any of the summarising itself, it just produces a markdown transcript, so if you want summaries then you need to pass the output to an AI.

The app is sandboxed and your audio/transcripts are never uploaded anywhere - they just exist as audio files and markdown on disk. The only network call Trace is required to make is on the first run to download the speech and speaker models (around 500MB) from Hugging Face, and after that it can be used fully offline. If enabled, a Google Calendar integration can auto-name sessions but that needs a network connection.

The app is £9.99 on the macOS App Store. I've been using it every day for months now and I'm super happy with how it's improved my workflow. Feedback very welcome.

Advertisement

⚡ Community Insights

Discussion Sentiment

75% Positive

Analyzed from 554 words in the discussion.

Trending Topics

#apps#don#source#large#fluidaudio#open#nice#macwhisper#used#https

Discussion (22 Comments)Read Original on HackerNews

blopkerabout 2 hours ago
Nice! I really like how many variations on this idea are coming out. MacWhisper used to be great, but is kinda of a buggy mess now.

I'm making my own, for personal use. I did a survey of many and they all (that I could find) skip the fundamentals.

The major issues that I've run into:

- Crash recovery. Most of these apps are incredibly buggy and crash all the time, taking the recorded audio with them. Macwhisper is incredibly bad at this.

- Disk space. Many of these apps save wav files to disk. After a few hours of meetings, you may end up with gigabytes eaten.

- Microphone bleed. People don't always use headphones, the system mic will pick up the speaker sounds, causing duplicate (approximately) transcriptions.

I've yet to find a solution that handles all these correctly, let alone having high quality transcriptions.

Anyway, most of these apps are built around https://github.com/FluidInference/FluidAudio, if anyone is curious. Their readme has a big list of similar apps as well.

highmastdonabout 1 hour ago
I’m using MacParakeet these days. If your language is supported, definitely give it a try. It’s much faster and lower footprint
jv22222about 2 hours ago
Nice tip on FluidAudio that's the kind of thing I've been looking for. Thanks!
robertkarl14 minutes ago
This looks sick. I was going to download it but for $10 I am more willing to attempt asking Claude to implement something like it, than to purchase.

I would be more willing to purchase if it was open source and I could build from source to try it first.

denbycabout 2 hours ago
I'd love to have a purchase option not tied to the App Store if possible. I don't use an Apple account with my Mac, but I would love to try Trace.
mushufasaabout 3 hours ago
This looks like a good approach, though I would expect this to be a native macOs feature within 12 months -- this seems totally like it fits into their product roadmap.
nkmnzabout 2 hours ago
Which Speech-to-Text is used? Is it possible to configure it? This might be crucial for supporting languages other than English - the model that comes built-in with macOS fails completely for German.
frabiaabout 3 hours ago
Super interesting! How accurate is the local model to transcribe audio compared to other cloud services? E.g. Google Meet, Otter, Granola, etc.
watchlightabout 2 hours ago
A lot of the available models are Whisper or Faster-Whisper derived and shared across multiple apps. The tier names are often funny... "Tiny" "base" "small" "medium" "large" "large-v2" "large-v3" "large-v3-turbo" -en only variants, etc.

In my experience, medium is often the sweet spot for English accuracy vs speed, especially if following-up with a post-processing pass. The large options are all fine, but can severely slow it down. There are some speed checks on my website if you're curious (link not posted because I don't want to hijack another post's app).

watchlightabout 6 hours ago
Agreed with JohnBiz, the moment flagging is interesting and unusual, and a nice contrast to passive transcription. I only recently learned about MacWhisper (I'm Windows primarily) and was floored to learn how expensive the Pro option is. Nowadays it's not so hard to have some-level of DIY transcription, so crazy that it's priced with a premium.

What's your diarization pipeline? Pyannote?

I'd taken a different approach that used a LLM clean-up pass to summarize and progressively compress the transcript for ultra-long content, but I like the idea of targeted "pay attention here" flags.

nazcaabout 3 hours ago
I've been looking for this exact thing!
overflowyabout 3 hours ago
Does it support multiple languages?
satvikpendemabout 2 hours ago
I don't see how this is different to literally the dozens of other offline transcription apps, many open source even unlike this one.
hmokiguessabout 2 hours ago
can you share them? I'm looking for a decent open source one
infl8edabout 1 hour ago
I don't mind https://matthartman.github.io/ghost-pepper/ however I do really want speaker recognition which it does have but I haven't been able to get it working.
jv22222about 2 hours ago
vermilinguaabout 2 hours ago
I don’t see any there that are as focused as this one, perhaps except Talat which is considerably more expensive.
hmokiguessabout 1 hour ago
I went through the list but most feel subpar to me, and some aren't even open source (just claim they use FluidAudio I guess?)
jv22222about 2 hours ago
Classic HN. Thanks for keeping it real.
satvikpendem17 minutes ago
There are so many I've seen on show HN, that's why.