Pepe's Braindump

Bag of words

tags
NLP

Bag of words is a technique to transform sentences to vectors in which a BOW is created first, containing the amount of vocabulary we want, ordered. Then we represent each sentece with a vector of bools which trues represent that that word was present in the sentence, false that it wasn’t.

One of the disadvantages of this technique is that order of words in the original sentence is lost.

Cortex theme by Jethro Kuan. Built with org-mode, org-roam and Hugo