
Navigate the NLP with Python curriculum from text basics to deep learning, covering spaCy, NLTK, tokenization, stemming, lemmatization, stop words, part of speech tagging, and text classification.
Explore python text basics by reading text and PDFs with built-in libraries and no external dependencies. Learn basic regular expressions to find patterns in text and complete an assessment.
Learn to read and extract text from PDFs using PyPDF2, and handle limitations with scanned or non-extractable PDFs. Create and append pages to new PDFs with Python.
Explore the NLP assessment in Python text basics, including f-string printing, file creation and reading, PDF text extraction with PyPDF2, appending to Contacts.txt, and regex email extraction.
We set up spaCy and the required language library, explore tokenization, stemming, lemmatization, and stop words, and compare NLTK and spaCy while outlining what natural language processing is.
Explore spaCy, the open source python NLP library designed for speed, and compare it to NLTK. Learn to install spaCy, download the English language model, and load it for use.
Explore what natural language processing is and how computers turn raw English text into structured data using spaCy and NLTK, with examples like spam detection and sentiment analysis.
Explore stemming in natural language processing using Porter's and Snowball stemmers in Python with NLTK, and compare their effects on words like run, runs, and fairly.
Learn to filter stop words with spaCy, view the default 305 English stop words, and add or remove custom stop words like btw to tailor NLP preprocessing.
Explore phrase matching and vocabulary techniques in Python NLP, using a phrase matcher on a Reaganomics text to extract terms like supply-side and trickle-down economics with context.
Visualizes parts of speech with displaCy in spaCy, showing POS tags and syntactic dependencies via render or serve, with options for distance, compact, color, background, and fonts.
Learn to add multiple named entities with a spacey phrase matcher, creating product entities like vacuum space cleaner and vacuum dash cleaner, and count entities by label such as money.
Demonstrate a spaCy driven parts-of-speech assessment workflow: load English, build a doc from Peter Rabbit, analyze tokens, POS tags, dependency parse, noun frequency, and named entities.
Explore how classification metrics evaluate NLP models, focusing on accuracy, recall, precision, and F1 score through a binary spam vs ham example, using vectorization and a confusion matrix.
Explore manual vocabulary construction and feature extraction from text, then vectorize across documents with scikit-learn using count vectorizer and tf-idf, and build pipelines for text classification.
Complete a text classification assessment by loading moviereviews2.tsv into a pandas data frame, building a vectorization pipeline, training a model, and evaluating with the confusion matrix, classification report, and accuracy.
Explore semantics and sentiment analysis using spaCy and Python. Learn the general syntax for sentiment analysis and apply Vader with NLTK for text classification.
Explore spaCy word vectors and document vectors in Python, learn vector similarities and king minus man plus woman as a practical semantic demonstration.
Learn to perform sentiment analysis with Vader in Python using NLTK. Create a sentiment intensity analyzer, interpret negative, neutral, positive, and compound scores, and apply it to Amazon reviews.
Explore the topic modeling assessment project with over 400,000 Quora questions, applying tf-idf vectorization and non-negative matrix factorization to extract 20 topics. Use random state 42 to align topic results.
Explore the basics of deep learning for natural language processing, study RNNs and LSTMs to generate text from a source corpus like Moby-Dick, and build Q&A chatbots in Python.
Connect multiple perceptrons into a multi-layer neural network with input, hidden, and output layers, then compare sigmoid, tanh, and ReLU activations, using z = w x + b and Keras.
Explore how LSTM and GRU cells manage long-term memory for text generation, detailing forget and input gates, cell state updates, and peepholes, using Keras for practical RNN data preparation.
Learn to vectorize the dataset with Python by tokenizing text, building word indices, and padding sequences for stories, questions, and answers, using a reusable vectorize_stories function.
Create and train a Python chat bot, load the chatbot_10.h5 file after 100 epochs, evaluate on the test set, plot training history, and craft custom stories and questions.
Welcome to the best Natural Language Processing course on the internet! This course is designed to be your complete online resource for learning how to use Natural Language Processing with the Python programming language.
In the course we will cover everything you need to learn in order to become a world class practitioner of NLP with Python.
We'll start off with the basics, learning how to open and work with text and PDF files with Python, as well as learning how to use regular expressions to search for custom patterns inside of text files.
Afterwards we will begin with the basics of Natural Language Processing, utilizing the Natural Language Toolkit library for Python, as well as the state of the art Spacy library for ultra fast tokenization, parsing, entity recognition, and lemmatization of text.
We'll understand fundamental NLP concepts such as stemming, lemmatization, stop words, phrase matching, tokenization and more!
Next we will cover Part-of-Speech tagging, where your Python scripts will be able to automatically assign words in text to their appropriate part of speech, such as nouns, verbs and adjectives, an essential part of building intelligent language systems.
We'll also learn about named entity recognition, allowing your code to automatically understand concepts like money, time, companies, products, and more simply by supplying the text information.
Through state of the art visualization libraries we will be able view these relationships in real time.
Then we will move on to understanding machine learning with Scikit-Learn to conduct text classification, such as automatically building machine learning systems that can determine positive versus negative movie reviews, or spam versus legitimate email messages.
We will expand this knowledge to more complex unsupervised learning methods for natural language processing, such as topic modelling, where our machine learning models will detect topics and major concepts from raw text files.
This course even covers advanced topics, such as sentiment analysis of text with the NLTK library, and creating semantic word vectors with the Word2Vec algorithm.
Included in this course is an entire section devoted to state of the art advanced topics, such as using deep learning to build out our own chat bots!
Not only do you get fantastic technical content with this course, but you will also get access to both our course related Question and Answer forums, as well as our live student chat channel, so you can team up with other students for projects, or get help on the course content from myself and the course teaching assistants.
All of this comes with a 30 day money back garuantee, so you can try the course risk free.
What are you waiting for? Become an expert in natural language processing today!
I will see you inside the course,
Jose