Word Frequency

Word Frequency

Word Frequency 

Description

Word frequency is the number of occurrences of a word in a given text. 

Why to use

Textual Analysis – Pre Processing 

When to use

When you want to find the frequency of a word, that is, the number of times a particular word has appeared in a given text. 

When not to use

On numerical data.

Prerequisites

It should be textual data.  

Input

It was the best of times it was the worst of times.
It was the age of wisdom it was the age of foolishness.


Output

It – 4
Was – 4
The – 4
Best – 1
Of – 4
Times - 2
Worst - 1
Age – 2
Wisdom - 1
Foolishness – 1

Related algorithms

  • Case Convertor
  • Custom Words Remover
  • Frequent Words Remover
  • Lemmatizer
  • Punctuation Remover
  • Spelling Corrector
  • Stemmer
  • Advanced Entity Extraction
  • Word Correlation

Alternative algorithm


Statistical Methods used



Limitations

It cannot be used on Numerical data.


Word frequency is the number of occurrences of a word in the textual data.
    • Related Articles

    • Word Frequency

      Word Frequency Description Word frequency is the number of occurrences of a word in a given text. Why to use Textual Analysis – Pre Processing When to use When you want to find the frequency of a word, that is, the number of times a particular word ...
    • Word Correlation

      Word Correlation Description Word correlation refers to the association or relationship between two words in a text. It determines whether and how strongly pairs of quantitative and continuous variables (in this case, words) are related to each ...
    • Word Embedding

      Word Embedding Description Word Embedding is a form of word representation that bridges the human understanding of language to that of a machine. These are found to be useful representations of words and lead to better performance in the various ...
    • Word Embedding

      Word Embedding Description Word Embedding is a form of word representation that bridges the human understanding of language to that of a machine. These are found to be useful representations of words and lead to better performance in the various ...
    • Stemmer

      Stemmer Description The automated process produces a base string in an attempt to represent related words. For example, if the words are "runs", "running", and "runner", then the algorithm will automatically reduce these words to the root word "run". ...