Bezoku
@bezoku.bsky.social
290 followers 1.9K following 150 posts
Language empowerment technology Bezoku.ai #NLProc #PyTorch #culture #langsky #universal #linguistics #language #tokenizer #semantic #python #syntax #homomorphism #discourse
Posts Media Videos Starter Packs
Reposted by Bezoku
Inspired to share some papers that I found at #COLM2025!

"Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation" by Amanda Myntti et al. arxiv.org/abs/2504.01542
Title, authors, and abstract of the paper. Figure 3: Change of accuracy from first to final checkpoint on individual benchmarks shown as a range, with grey indicating the first checkpoint and colours indicating the last checkpoint. The random-guess threshold is shown as a grey vertical line in cases where at least one model falls below it. Bars and legend shown in order of average accuracy.
#windows10 support ceased today.

DM for an info sheet on how to convert a windows 10 machine to run on a secure, localized Linux desktop with a Bezoku local language model.

#LowResourceLanguages #NLProc #eWaste
Reposted by Bezoku
It’s me or a lot of Wikipedia French articles seem to have been automatically translated lately and without saying so?

Like this one for exemple:

fr.wikipedia.org/wiki/George_...
George Santos — Wikipédia
fr.wikipedia.org
We need a bench of skilled linguists who have worked on corpora and treebanks applying their skills for syntactic annotation.

Attached is a list of supported languages for the first release of the bezoku platform.

DM to learn more
We updated the list of papers we are reading after #syntaxfest - focused around low resource and indigenous language technology developments.

What did we miss 🤷‍♂️

#NLProc #Linguistics #NLP

github.com/bezokurepo/p...
github.com
“Why did the French chef kill himself?

Because he lost the huile d’olive”

The role of humour in language is so important, that’s why we include jokes in every corpus.

#linguistics #NLProc

www.reddit.com/r/French/com...
From the French community on Reddit
Explore this post and more from the French community
www.reddit.com
Reposted by Bezoku
🚨Postdoctoral fellowship in corpus phonetics / data science for speech with me and Ann Bradlow. Position is open immediately. Apply now! 🚨 Details: faculty.wcas.northwestern.edu/matt-goldric...
I will try to mention at dinner this evening 🙏
Apologies, made the wrong inference 🙄. It is a wonderful event
Amazing. Bezoku is attending and not an organizer, we don’t pretend to take any credit . It is an amazing community and opportunity for learning
The organizers did a great job for all 5 events
Not sure. There are ~120 people at the event so maybe someone is putting something out
Welcome to Slovenia.

#syntaxfest is in full swing and the agenda is off the charts.

#syntax #semantics #morphosynyactic #linguistics #language
The team are heading to Slovenia on Monday to attend the biannual #syntaxfest

DM to meet and discuss #LSTM architectures and the application of the CoNLL-U standard to #lowresourcelanguages and #indigenouslanguage

#nolanguageleftbehind

syntaxfest.github.io/syntaxfest25/
SyntaxFest 2025 | Ljubljana, Slovenia
syntaxfest.github.io
Reposted by Bezoku
Open source has been embraced by hyperscalers to drive innovation and level the playing field.

Open source wasn’t embraced to give up advantage—but to gain it.
After building our network in Florida, we have decided to re-locate to a country that embraces diversity, equity and inclusion.

We have enjoyed every minute in Miami, but State and Federal government is sliding towards fascism.

#DEI #Linguistics #NLProc

www.latintimes.com/ignorant-mag...
'Ignorant' MAGA Influencer Praises Trump Admin for Florida Deportations: 'My Uber Drivers Finally Speak English'
A MAGA supporting social media user drew ire from social media users after thanking White House Press Secretary Karoline Leavitt for bringing back Uber drivers who speak english.
www.latintimes.com
❤️ Linux

The future is largely built on top of Linux, and that goes for Bezoku too

#linux #NLProc #opensource
Reduced costs.
Increased competition.
Open source enables AI to be built faster with focus on high-value applications.

#OpenSource #AI #CloudNative #LLMs
We endorse this approach to re-purpose computers from Windows.

We have a release in a couple of months to help add options for low resource languages

#windows10 #ewaste #NLProc
Hundreds of thousands of Computers won't be able to upgrade to Windows 11, but that shouldn't make them eWaste. Kudos to the kde team for this amazing initiative! This website provides options and information for installing KDE desktop and reuse older computers safely

endof10.org

#linux #kde
End of 10
endof10.org