Skip to content

luk-kaminski/vep

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

About

Vector Embedding Playground is a tool that helps you play with Vector Embeddings (provided in a file).

Prerequisites

Needed Python libraries: numpy, matplotlib, scikit-learn.

pip3 install numpy
pip3 install matplotlib
pip3 install scikit-learn

Supported file formats

Usage

  1. Get a file with vector embeddings.
  2. Start the program: python3 vep.py [vector_embeddings_file]
  3. Type help to get the list of supported functions.
  4. Play with it!

Examples of what you can achieve

similarity_distribution

embeddings_clustering_visualisation

Clues

The files with Vector Embeddings tend to be huge. Significant portion of words are rare or even strange. You can cut only the words from the top (the most popular ones) and work on a smaller file, faster. The command for that is:

python3 vector_file_cutter.py [file] [word_limit]

i.e.:

python3 vector_file_cutter.py glove.6B.300d.txt 30000

Warning

The project is still work in progress and has some problems here and there...

About

Vector Embedding Playground

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages