For Knowledge For Experts For Teams Blog
Sign Up
menu
Natural Language Processing (NLP) - Tokenization & Regular Expressions
course-icon
clocklogo
Duration
1.25 hours session
Course outline
Design
Tokenization & Regular Expressions:-
- Overview of Tokenization
- Linguistic theory for Word Segmentation
- Tokenization with NLTK
- Cliticisation & Contractions in Tokenization
- Contractions Library
- Overview of Regular Expressions
- Word Segmentation
- Sentence Segmentation
- ReGex Split & Subsitute Method
- Search Method
Prerequisite knowledge
Python Programming Knowledge
Basic Natural Language Processing (NLP) Knowledge
Learning outcomes
This micro-learning session will take the learners through NLP techniques, Tokenization and Regular Expression, for parsing text.
Software / Hardware requirements
Anaconda Distribution
Install the ‘64-bit Graphical installer’
Python
(For both Windows & macOS)
Sugggested link:
https://www.anaconda.com/products/individual
course-icon
Natural Language Processing (NLP) - Stemming & Lemmatization
1.25 hours session
course-icon
Natural Language Processing (NLP) - Text Summarization
1.25 hours session
course-icon
Deep Learning - Natural Language Processing (NLP)
3 hours session
Want to upskill or acquire a new skill?
Start Learning!