NLP practitioners: this episode is for you. From the awareness of linguistic elements and annotation to getting the necessary people in the room, Vincent Warmerdam presents to Jon Krohn a recipe for a successful project and the open-source NLP tools to get there.
This episode is brought to you by epic LinkedIn Learning instructor Keith McCormick (
https://linkedin.com/learning/instructors/keith-mccormick). Interested in sponsoring a SuperDataScience Podcast episode? Visit
JonKrohn.com/podcast for sponsorship information.
In this episode you will learn:
• How Vincent came to work with De Speld [08:57]
• Vincent’s role at Explosion [18:59]
• How users can apply spaCy [21:46]
• Prodigy: Annotate training data more efficiently with scripts [26:28]
• How to manage “skill anxiety” with Calmcode [32:32]
• How Vincent fixed bad labels [42:47]
• The value of understanding linguistics for NLP [54:42]
• How to constrain artificial stupidity [1:02:38]
Additional materials:
www.superdatascience.com/659