Skip to content

Tag Archives: Python-nltk

Natural Language Toolkit (NLTK) is a platform used for building programs for text analysis. We can observe that male and female names have some distinctive… Read More
WordNet is the lexical database i.e. dictionary for the English language, specifically designed for natural language processing.  Synset is a special kind of a simple… Read More
Why do we need to train a sentence tokenizer? In NLTK, default sentence tokenizer works for the general purpose and it works very well. But… Read More
A single token is referred to as a Unigram, for example – hello; movie; coding. This article is focused on unigram tagger. Unigram Tagger: For… Read More
Chunk extraction or partial parsing is a process of meaningful extracting short phrases from the sentence (tagged with Part-of-Speech). Chunks are made up of words and… Read More
How does Wu & Palmer Similarity work? It calculates relatedness by considering the depths of the two synsets in the WordNet taxonomies, along with the depth… Read More
Collocations are two or more words that tend to appear frequently together, for example – United States. There are many other words that can come… Read More
Below are the steps involved for Chunking –   Conversion of sentence to a flat tree.   Creation of Chunk string using this tree. Creation of RegexpChunkParser… Read More
SplitRule class : It splits a chunk based on the specified split pattern for the purpose. It is specified like <NN.*>}{<.*> i.e. two opposing curly… Read More
Natural Language Processing (NLP) is a subfield of computer science, artificial intelligence, information engineering, and human-computer interaction. This field focuses on how to program computers… Read More
What is Part-of-speech (POS) tagging ? It is a process of converting a sentence to forms – list of words, list of tuples (where each… Read More

Start Your Coding Journey Now!