Tokenization & Regular Expressions:-
- Overview of Tokenization
- Linguistic theory for Word Segmentation
- Tokenization with NLTK
- Cliticisation & Contractions in Tokenization
- Contractions Library
- Overview of Regular Expressions
- Word Segmentation
- Sentence Segmentation
- ReGex Split & Subsitute Method
- Search Method
This micro-learning session will take the learners through NLP techniques, Tokenization and Regular Expression, for parsing text.