-
Local Twitter Search engine including:
-
Custom build Parser to collect data from Emojis Numbers, Countries, Entities and more.
-
Smart inverted index for creating posting files.
-
GloVe: Global Vectors for Word Representation.
-
W2V : Word2Vec technique for natural language processing.
-
WordNet : get information such as Synonyms, Hypernyms and Hyponyms.
Also includes the following API's and libs:
gensim.models, NumPy, Pandas, SciPy spatial, nltk, PorterStemmer, emoji etc.
The quality of results tested against Benchmark DB, with MAP, Recall, precision, precision@5 measurements.
includes sample.parquet files.