💾 Archived View for bbs.geminispace.org › u › Half_Elf_Monk › 21762 captured on 2024-12-17 at 15:41:59. Gemini links have been rewritten to link to archived content

View Raw

More Information

-=-=-=-=-=-=-

Comment by 🌲 Half_Elf_Monk

Re: "post text, not audio"

In: s/permacomputing

I appreciate thinking about this topic, and the comments put here so far. I also like listening to audio, which works well in the workflow (or lack thereof) in a given day. The capacity to generate accurate transcriptions is helpful.

turboscribe.ai blew my mind when I found it. This was a way to generate reasonably accurate text transcription of the audio, and do it relatively easily. I haven't found a way to do this locally, but @requiem mentioned something about Whisper so maybe that's the answer I need.

I don't know why podcasting services couldn't do this automatically for all the podcasts they host/serve. Or why someone couldn't train an AI (using their GPU's), and then distribute the model for others to use on their less-intensive machines. Maybe I'm not understanding the complexity.

I'd be interested to hear @norayr 's thoughts on why that isn't a permacomputing solution. I guess it's not sustainable in a very long run sort of way, but if we train the voice models now, wouldn't we be able to use that model down the road reasonably well? If all of society goes down in a CME or war or something, I have way more important stuff to do than worrying about whether listeners get a transcript of my podcast.

In any case, if anyone is running a local AI to generate good transcripts, please report in with your experience. That sounds very very useful.

🌲 Half_Elf_Monk

Nov 14 · 5 weeks ago

3 Later Comments ↓

🐙 norayr [OP] · Nov 16 at 00:17:

well i think i explained i wasnt right and asked to excuse me, i had some feeling at the point of time which made me write that.

but to sum up

🐙 norayr [OP] · Nov 16 at 00:39:

🌲 Half_Elf_Monk [✝️] · Nov 16 at 21:42:

Ah, thanks @norayr , I think I understand you now. I also would rather the models were freed... from corporate and state dependence. I'm hopeful that such things will exist in the future. Cheers...

Original Post

🌒 s/permacomputing

post text, not audio — publishing audio is convenient, but how to find it on the internet? we even agree on that images should have alt descriptions. otherwise we should rely on ai (which is not lowtech) to find us audio or video files that have the information we search for. p. s. that also relates to 'voice messages' in chats. it is easy to message, but it is not possible later to find the information in the chat log. again, ai may help, but do we want it to help? also, while it is easy...

💬 norayr · 9 comments · Jul 18 · 5 months ago