Page: 5 · Explosion · Developer tools and consulting for AI, Machine Learning and NLP

Explosion builds developer tools for AI, Machine Learning and Natural Language Processing. →
Consulting

Project

Topics

Category

Tasks

Authors

Filtered by page: 5

✨ prodigy v1.11.0Aug 12, 2020

spaCy v3 support, annotation for overlapping and nested spans, better installation & more

Welcome spaCy to the Hugging Face Hub

Welcome spaCy to the Hugging Face Hub Hugging Face Blog

Hugging Face makes it really easy to share your spaCy pipelines with the community! With a single command, you can upload any pipeline package, with a pretty model card and all required metadata auto-generated for you.

Applied NLP Thinking: How to Translate Problems into Solutions

Applied NLP Thinking: How to Translate Problems into Solutions

We’ve been running Explosion for about five years now, which has given us a lot of insights into what Natural Language Processing looks like in industry contexts. In this blog post, I’m going to discuss some of the biggest challenges for applied NLP and translating business problems into machine learning solutions.

Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat Violence

Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat Violence Halterman, Keith, Sarwar, O’Connor (2021), ACL 2021

Figure A2 shows a stylized version of the custom interface we built using the Prodigy annotation tool. Annotators are presented with an entire document, with sentences sequentially highlighted.

How We Found Pricey Provisions in New Jersey Police Contracts

How We Found Pricey Provisions in New Jersey Police Contracts ProPublica

ProPublica and the Asbury Park Press scoured hundreds of police union agreements for details on publicly funded payouts to cops, using spaCy under the hood.

Introducing spaCy v3.0

Introducing spaCy v3.0

spaCy v3.0 is a huge release! It features new transformer-based pipelines that get spaCy's accuracy right up to the current state-of-the-art, and a new workflow system to help you take projects from prototype to production. It's much easier to configure and train your pipeline, and there are lots of new and improved integrations with the rest of the NLP ecosystem.

🔮 thinc v8.0.0Jan 24, 2021

Type checking, wrap PyToch, TensorFlow & MXNet, integrated config system

Building Industrial-Strength NLP Pipelines

Building Industrial-Strength NLP Pipelines Weights & Biases

How We Analyzed Google’s Search Results

How We Analyzed Google’s Search Results The Markup

Using the Prodigy annotation tool, we created a user interface and a coder manual for two annotators to spot-check 741 stained images randomly sampled from our dataset.

Introducing spaCy v2.3

Introducing spaCy v2.3

spaCy now speaks Chinese, Japanese, Danish, Polish and Romanian! Version 2.3 of the spaCy Natural Language Processing library adds models for five new languages. We've also updated all 15 model families with word vectors and improved accuracy, while also decreasing model size and loading times for models with vectors.

🛸 spacy-transformers v0.6.0May 24, 2020

Update to transformers v2.5.0

Intro to NLP with spaCy (5): Detecting programming languages

Intro to NLP with spaCy (5): Detecting programming languages

The Future of NLP in Python

The Future of NLP in Python PyCon Colombia Keynote

The data community came to Python for the language, and stayed for each other – once it got critical mass, it’s the ecosystem that counts. We’ve been proud to be part of that. So what does the future hold for NLP in Python?

spaCy meets Transformers

spaCy meets Transformers Hacking Machine Learning

Let Them Write Code

Let Them Write Code PyCon India Keynote

Entity linking for spaCy: Grounding textual mentions

Entity linking for spaCy: Grounding textual mentions Belgium NLP Meetup

Introduction to Japanese Natural Language Processing

Introduction to Japanese Natural Language Processing Masato Hagiwara, Paul O’Leary McCann (2021)

A thorough guide for programmers working with Japanese text, covering fundamental issues like tokenization and recent research topics like generating natural language texts.

🤗 spacy-huggingface-hub v0.0.1Jul 6, 2021

Upload spaCy pipelines to the Hugging Face Hub

Interview: Ines Montani & Sebastián Ramírez

Interview: Ines Montani & Sebastián Ramírez PyFest

Building Industrial-Strength NLP Applications

Building Industrial-Strength NLP Applications Snorkel Science Talks

spaCy v3: Custom trainable relation extraction component

spaCy v3: Custom trainable relation extraction component

spaCy v3.0 features new transformer-based pipelines that get spaCy’s accuracy right up to the current state-of-the-art, and a new training config and workflow system to help you take projects from prototype to production. In this video, Sofie shows you how to apply all these new features when implementing a custom trainable component from scratch.

Building a Data Science Startup

Building a Data Science Startup TalkPython Podcast

Ines becomes a Python Software Foundation Fellow

The Physical Traits that Define Men and Women in Literature

The Physical Traits that Define Men and Women in Literature The Pudding

Analysis of physical traits most tied to gender in literature using spaCy.

👑 spacy-streamlit v0.0.2Jun 23, 2020

spaCy building blocks and visualizers for Streamlit apps

Building customizable NLP pipelines with spaCy

Building customizable NLP pipelines with spaCy Turku.AI Meetup

Training a custom entity linking model with spaCy

Training a custom entity linking model with spaCy

In this video, we show you how to create a custom Entity Linking model in spaCy to disambiguate different mentions of the person “Emerson” to unique identifiers in a knowledge base.

Explosion in 2019: Our Year in Review

Explosion in 2019: Our Year in Review

As 2019 draws to a close and we step into the 2020s, we thought we’d take a look back at the year and all we’ve accomplished. And we realized we had so much that we could give you a month-by-month rundown of everything that happened.

✨ prodigy v1.9.0Dec 18, 2019

Custom UI blocks, text input UI, better training and data conversion

sense2vec reloaded: contextually-keyed word vectors

sense2vec reloaded: contextually-keyed word vectors

In 2016 we trained a sense2vec model on the 2015 portion of the Reddit comments corpus, leading to a useful library and one of our most popular demos. That work is now due for an update. In this post, we present a new version and a demo NER project that we trained to usable accuracy in just a few hours.

spaCy and the future of multi-lingual NLP

spaCy and the future of multi-lingual NLP META Forum

Intro to NLP with spaCy (2): Detecting programming languages

Intro to NLP with spaCy (2): Detecting programming languages

Mastering spaCy

Mastering spaCy Duygu Altinok (Packt Publishing, 2021)

An end-to-end practical guide to implementing NLP applications using the Python ecosystem. By the end of this book, you'll be able to confidently use spaCy, including its linguistic features, word vectors, and classifiers, to create your own NLP apps.

What does “real-world NLP” look like and how can students get ready for it?

What does “real-world NLP” look like and how can students get ready for it?Teaching NLP at NAACL Keynote

A Bit of AI Episode 7

A Bit of AI Episode 7 A Bit of AI

🦆 sense2vec v2.0.0Feb 7, 2021

Update component for spaCy v3

spaCy v3: Design concepts explained (behind the scenes)

spaCy v3: Design concepts explained (behind the scenes)

In this video, Ines shows you some of the new design concepts and explain what’s going on under the hood, how we’ve implemented them and most importantly, why.

Explosion in 2020: Our Year in Review

Explosion in 2020: Our Year in Review

While 2020 hasn’t been easy for anyone, at Explosion we’ve considered ourselves relatively fortunate in this most interesting year. We’ve always worked remotely, so we’ve been able to take both pride and comfort in continuing to ship good software. Here’s a look back at what we’ve been up to.

spaCy v3.0: Bringing State-of-the-art NLP from Prototype to Production

spaCy v3.0: Bringing State-of-the-art NLP from Prototype to Production Global AI Live Keynote

🦉 srsly v2.1.0Jun 22, 2020

Support YAML

Designing Practical NLP Solutions

Designing Practical NLP Solutions L3-AI

Image Captioning with Prodigy & PyTorch

Image Captioning with Prodigy & PyTorch

In this video, we’ll show you how you can use Prodigy to script fully custom annotation workflows in Python, how to plug in your own machine learning models and how to mix and match different interfaces for your specific use case.

Intro to NLP with spaCy (4): Detecting programming languages

Intro to NLP with spaCy (4): Detecting programming languages

Intro to NLP with spaCy (3): Detecting programming languages

Intro to NLP with spaCy (3): Detecting programming languages

🦆 sense2vec v1.0.0Nov 22, 2019

More features, 2019 Reddit vectors model and Prodigy recipes

Interview with Ines Montani Sayak Paul

Ines talks about how she got into programming, how to stay up to date with the latest developments in our field and the ideas behind the PyCon India keynote “Let Them Write Code”.

Explosion awarded META Seal of Recognition

Explosion awarded META Seal of Recognition

We’re proud to accept the META Seal of Recognition at META-FORUM in Brussels, along with Mozilla. The META-FORUM is an international conference series backed by the European Union on powerful and innovative Language Technologies for a multilingual information society.

Millennials Kill Everything

Millennials Kill Everything The Pudding

Analysis on media reporting of millenials using spaCy. From napkins to marriage to Applebees, just looking at headlines you’d guess that for the past decade the millennial generation’s been on a rampage.

Introducing spaCy v3.1

Introducing spaCy v3.1

It’s been great to see the adoption of spaCy v3, which introduced transformer-based pipelines, a new training system and more. Version 3.1 adds more on top of it, including the ability to use predicted annotations during training, a component for predicting arbitrary and overlapping spans and new pipelines for Catalan and Danish.

spaCy v3: State-of-the-art NLP from Prototype to Production

spaCy v3: State-of-the-art NLP from Prototype to Production Bay Area NLP

Intro to NLP with spaCy (6): Detecting programming languages

Intro to NLP with spaCy (6): Detecting programming languages

🛸 spacy-transformers v1.0.0Feb 1, 2021

Update components for spaCy v3.0

spaCy v3: State-of-the-art NLP from Prototype to Production

spaCy v3: State-of-the-art NLP from Prototype to Production

How to build resilient NLP applications

How to build resilient NLP applications Rasa Chats

Ines Montani brought linguistic and computers together DevJourney

Prodigy v1.10: Dependencies, relations, audio, video & more

Prodigy v1.10: Dependencies, relations, audio, video & more

Version 1.10 of Prodigy includes tons of new features, including manual dependency and relation annotation, audio and video annotation, a new and improved image UI, new recipe callbacks, more settings for manual NER, plus various new config options and settings.

✨ prodigy v1.10.0Jun 16, 2020

Dependency and relation annotation, audio, video, character-based NER & more

Training a Named Entity Recognition Model with Prodigy and Transfer Learning

Training a Named Entity Recognition Model with Prodigy and Transfer Learning

In this video, we’ll show you how to use Prodigy to train a named entity recognition model from scratch, by taking advantage of semi-automatic annotation and modern transfer learning techniques.

PyCon Colombia Speaker Interview

PyCon Colombia Speaker Interview

Karo and Ines talked about getting into tech and machine learning, and what’s next for spaCy and our other tools.

Künstliche Intelligenz Beyond the Hype

Künstliche Intelligenz Beyond the Hype Zündfunk Netzkongress (German)

“Artificial intelligence” is everywhere in the headlines. Many futuristic-sounding things suddenly seem possible. It’s not easy to judge what all these technological advances mean. What is hype and what really works? And how should we imagine the future?

Using spaCy with Hugging Face Transformers

Using spaCy with Hugging Face Transformers PyCon India

Transformer models like BERT have set a new standard for accuracy on almost every NLP leaderboard. However, these models are very new, and most of the software ecosystem surrounding them is oriented towards the many opportunities for further research. In this talk, Matt describes how you can now use these models in spaCy to work on real problems and the many opportunities transfer learningfor production NLP, regardless of which software packages you choose.

Introducing spaCy v2.2

Introducing spaCy v2.2

Version 2.2 of the spaCy Natural Language Processing library is leaner, cleaner and even more user-friendly. In addition to new model packages and features for training, evaluation and serialization, we've made lots of bug fixes, improved debugging and error handling, and greatly reduced the size of the library on disk.

Intro to NLP with spaCy (1): Detecting programming languages

Intro to NLP with spaCy (1): Detecting programming languages

In this new video series, data science instructor Vincent Warmerdam gets started with spaCy, an open-source library for Natural Language Processing in Python. His mission: building a system to automatically detect programming languages in large volumes of text.