Skip to content

Tweeter Serach Engine. A smart retrieval engine, using unsupervised Algorithms

License

Notifications You must be signed in to change notification settings

Berdugo1994/Tweeter-Search-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A Smart Tweeter Search Engine.

  • Local Twitter Search engine including:

  • Custom build Parser to collect data from Emojis Numbers, Countries, Entities and more.

  • Smart inverted index for creating posting files.

Using 3 Main ALGORITHMS:

  • GloVe: Global Vectors for Word Representation.

  • W2V : Word2Vec technique for natural language processing.

  • WordNet : get information such as Synonyms, Hypernyms and Hyponyms.

Similarity: Cosine Similarity, TF-IDF, BM-25 and more.

Also includes the following API's and libs:

gensim.models, NumPy, Pandas, SciPy spatial, nltk, PorterStemmer, emoji etc.

The quality of results tested against Benchmark DB, with MAP, Recall, precision, precision@5 measurements.

includes sample.parquet files.

Tests

About

Tweeter Serach Engine. A smart retrieval engine, using unsupervised Algorithms

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages