🌎 explosion.ai
💼 linkedin.com/company/explosion-ai
🐘 sigmoid.social/@explosion
📺 youtube.com/c/ExplosionAI
explosion.ai/blog/gitlab-...
explosion.ai/blog/gitlab-...
10× speed-up of their data workflows and up to 99% accuracy at 6mb!
explosion.ai/blog/sp-glob...
10× speed-up of their data workflows and up to 99% accuracy at 6mb!
explosion.ai/blog/sp-glob...
This release is the foundation of the upcoming spaCy v4 release and adds support for more powerful learning rates.
We have also merged thinc-apple-ops into Thinc, so Apple AMX is supported out-of-the-box.
Details & release notes: github.com/explosion/th...
This release is the foundation of the upcoming spaCy v4 release and adds support for more powerful learning rates.
We have also merged thinc-apple-ops into Thinc, so Apple AMX is supported out-of-the-box.
Details & release notes: github.com/explosion/th...
To maximize ROI from your data engineering, evaluation metrics should be paired with quantitative error analysis. Our latest example error analysis recipe iterates through false positives/negatives and lets you record the reasons to inform your improvement plan.
To maximize ROI from your data engineering, evaluation metrics should be paired with quantitative error analysis. Our latest example error analysis recipe iterates through false positives/negatives and lets you record the reasons to inform your improvement plan.
During the training process, we recommend running Prodigy's train-curve command, which is a great way to quickly see whether more data of similar quality as the current dataset would improve the model.
During the training process, we recommend running Prodigy's train-curve command, which is a great way to quickly see whether more data of similar quality as the current dataset would improve the model.
Quantitative measurements of disagreements should always be accompanied by a qualitative analysis. Prodigy's review recipe is an excellent tool for that.
We use it in all our consulting projects to inform and illustrate data model discussions: explosion.ai/tailored-sol...
Quantitative measurements of disagreements should always be accompanied by a qualitative analysis. Prodigy's review recipe is an excellent tool for that.
We use it in all our consulting projects to inform and illustrate data model discussions: explosion.ai/tailored-sol...
Data development is an iterative process. It’s good practice to test your initial annotation scheme and guidelines during the pilot phase and measure the inter-annotator agreement.
Data development is an iterative process. It’s good practice to test your initial annotation scheme and guidelines during the pilot phase and measure the inter-annotator agreement.
📈 confusion matrix and per-label stats
🔎 explore examples your model struggles with most
🍬 entity-level insights for NER with MantisNLP's nervaluate library
github.com/explosion/pr...
📈 confusion matrix and per-label stats
🔎 explore examples your model struggles with most
🍬 entity-level insights for NER with MantisNLP's nervaluate library
github.com/explosion/pr...
🔒 Introducing the Prodigy Single Sign-On (SSO) plugin
It's the first in a series of premium Prodigy plugins for company licenses.
🔒 Introducing the Prodigy Single Sign-On (SSO) plugin
It's the first in a series of premium Prodigy plugins for company licenses.
Great work from the Nesta team & thanks to ESCoE for funding!
Great work from the Nesta team & thanks to ESCoE for funding!
A few project highlights in this thread 🧵✨
explosion.ai/blog/nesta-s...
A few project highlights in this thread 🧵✨
explosion.ai/blog/nesta-s...
spacy.io/api/large-la...
spacy.io/api/large-la...
🔗 Built-in entity linking support
💬 New task for translation from/to arbitrary languages
❓ Use the Doc as prompt for question answering
🧩 Arbitrarily long docs via sharding
github.com/explosion/sp...
🔗 Built-in entity linking support
💬 New task for translation from/to arbitrary languages
❓ Use the Doc as prompt for question answering
🧩 Arbitrarily long docs via sharding
github.com/explosion/sp...
Read & sign up: us12.campaign-archive.com?u=83b0498b1e...
Read & sign up: us12.campaign-archive.com?u=83b0498b1e...
To help explain this new feature, @koaning.bsky.social made a small demo to highlight the new feature 👀
youtu.be/vhbyekSsG8o
To help explain this new feature, @koaning.bsky.social made a small demo to highlight the new feature 👀
youtu.be/vhbyekSsG8o
Incorporate frameworks like HTMX for a dynamic interface using our latest Custom Events.
prodi.gy/docs/custom-....
Incorporate frameworks like HTMX for a dynamic interface using our latest Custom Events.
prodi.gy/docs/custom-....
prodi.gy/docs/changelog
☑️ A new toggle between token and character-based highlighting to NER and span UI: speedy token-based annotations and precise character highlighting! 🚀
prodi.gy/docs/changelog
☑️ A new toggle between token and character-based highlighting to NER and span UI: speedy token-based annotations and precise character highlighting! 🚀
You can see all the details here:
prodi.gy/docs/plugins
You can see all the details here:
prodi.gy/docs/plugins
It's a new plugin that allows you to train @huggingface.bsky.social NER models directly on annotated data in Prodigy. It also provides a recipe to upload annotations to Hugging Face HUB!
It's a new plugin that allows you to train @huggingface.bsky.social NER models directly on annotated data in Prodigy. It also provides a recipe to upload annotations to Hugging Face HUB!
To help explain how to use PDF segmentation and OCR @koaning.bsky.social made a small demo video to highlight the new feature 👀 www.youtube.com/watch?v=rwyz...
To help explain how to use PDF segmentation and OCR @koaning.bsky.social made a small demo video to highlight the new feature 👀 www.youtube.com/watch?v=rwyz...
To help explain this new feature, @koaning.bsky.social made a small demo to highlight the new feature 👀
youtu.be/jyu2nbjwfXw
To help explain this new feature, @koaning.bsky.social made a small demo to highlight the new feature 👀
youtu.be/jyu2nbjwfXw