💾 Archived View for gemi.dev › gemini-mailing-list › 000841.gmi captured on 2023-11-04 at 13:08:52. Gemini links have been rewritten to link to archived content

View Raw

More Information

➡️ Next capture (2023-12-28)

-=-=-=-=-=-=-

[users] Language tagging does not always tell the truth

Stephane Bortzmeyer <stephane (a) sources.org>

I was happy to see our first capsule in chinese but the language
tagging ('zh-TW') is misleading, all the texts are in
english. Strange.

Link to individual message.

Petite Abeille <petite.abeille (a) gmail.com>



> On Mar 30, 2021, at 09:03, Stephane Bortzmeyer <stephane at sources.org> wrote:
> 
> but the language tagging ('zh-TW') is misleading, all the texts are in english

franc: detect the language of text

https://github.com/wooorm/franc/tree/main/packages/franc

# '/usr/local/bin/franc' --ignore glg,vec --min-length 256 < 
'04.content.utf.txt' 2>/dev/null

?0?

Link to individual message.

---

Previous Thread: gemini.circumlunar.space seems outdated

Next Thread: laika.lk certificate