Topic: Computer Vision · Explosion · Developer tools and consulting for AI, Machine Learning and NLP

Explosion builds developer tools for AI, Machine Learning and Natural Language Processing. →
Consulting

Project

Topics

Tasks

Authors

Conquering PDFs: document understanding beyond plain text PyData London

In this talk, Ines presents a new and modular approach for building robust document understanding systems, using state-of-the-art models and the awesome Python ecosystem.

📚 spacy-layout v0.0.12Mar 8, 2025

Support processing PDFs with context, add document index tables and more docs

📚 spacy-layout v0.0.1Nov 18, 2024

Process PDFs, Word documents and more with spaCy

Describing Images Fast and Slow: Quantifying and Predicting the Variation in Human Signals during Visuo-Linguistic Processes Takmaz, Pezzelle, Fernández (2024)

We use the spaCy library for tokenization, part-of-speech tagging, and lemmatization of the words in the descriptions.

Prodigy-PDF for PDF annotation and OCR

Want to annotate PDF files? Our new Prodigy plugin can help with that! To explain how to use PDF segmentation and OCR, Vincent made a small demo video.

Image Captioning with Prodigy & PyTorch

In this video, we’ll show you how you can use Prodigy to script fully custom annotation workflows in Python, how to plug in your own machine learning models and how to mix and match different interfaces for your specific use case.

Conquering PDFs: document understanding beyond plain text PyCon DE & PyData

In this talk, Ines presents a new and modular approach for building robust document understanding systems, using state-of-the-art models and the awesome Python ecosystem.

📚 spacy-layout v0.0.6Nov 24, 2024

Add support for tables and convert tabular data to pandas.DataFrame

Microsoft Presidio v2.2.352

Context aware, pluggable and customizable PII de-identification and anonymization service for text and images, featuring a spaCy back-end.

🔌 prodigy-segment v0.1.0Dec 13, 2023

Select pixels in Prodigy via Meta’s “Segment Anything” model

Finetuning and Bulk Labelling Images with Prodigy

In this video, we’ll show how you might be able to improve the annotation experience by using bulk labelling for image classification.

Best Way to OCR a PDF in Python Python Tutorials for Digital Humanities

Tutorial by WJB Mattingly on how to use the new spaCy Layout package and Docling to convert PDFs to text.

🔌 prodigy-pdf v0.4.0Nov 25, 2024

Add text-based span annotation for PDFs

Prodigy-Segment for Pixel Segmentation

Use Meta’s “Segment Anything” model in Prodigy to help you select the right pixels in images.

🔌 prodigy-lunr v0.1.0Oct 5, 2023

Document search via LUNR to fetch relevant data subsets to label

Finding Bad Image Data using UMAP and Prodigy

In this video, we’ll show you how to use Prodigy to find bad examples in the Google QuickDraw dataset. We will be leveraging a technique that involves UMAP to find strange images semi-automatically.

From PDFs to AI-ready structured data: a deep dive

This blog post presents a new modular workflow for converting PDFs and similar documents to structured data and shows you how to build end-to-end document understanding and information extraction pipelines for industry use cases.

🔌 prodigy-pdf v0.3.0Nov 18, 2024

Support multi-page PDFs in a single view

Prodigy-ANN for Image Retrieval via CLIP

Dealing with a huge bucket of images that you want to annotate? The new image retrieval features in Prodigy-ANN (approximate nearest neighbors) might help!

🔌 prodigy-pdf v0.1.0Oct 5, 2023

Annotate and segment PDF files and perform OCR

Prodigy v1.10: Dependencies, relations, audio, video & more

Version 1.10 of Prodigy includes tons of new features, including manual dependency and relation annotation, audio and video annotation, a new and improved image UI, new recipe callbacks, more settings for manual NER, plus various new config options and settings.