Kmemes

sigh, why am I starting another side project when I already have so many that aren't finished. oh well, here's the plan:

~~steal~~ acquire pictures from reddit/the internet
perform kmeans learning on them
use the group representatives to generate new images
profit!!

Goals:

write kmeans, it should be decently fast
try to use CUDA? at least multi-threading
probably use c or "c with classes"(c++)

steps: web scraper to download photos

test algorithm on MNIST hand written digit samples. (it's sort of the "hello world" of ML datasets)

if this goes well we can try learning on memes.

Kmemes isn't really a generative algorithm so checkout meme-gan for my attempt at a GAN.

possibly figure out some kind of optimizations where when we calculate all the distances for group membership. we can possibly rule out the search area to members that we know are certainly not members of a given group.

TODO:

make python script to convert mnist data into more workable format
make python script to display output mean
implement multithreading
implement in CUDA
scape memes from reddit
preprocess memes to separate images with a lot of text
work on batch loading system to deal with datasets larger than available RAM

Timing results: Mnist: 60000 samples

Method	real (s)	user (s)	sys (s)	epochs
Python/Numpy¹	9,529.991	9,522.624	9,522.624	~64
Single Threaded C -O0	555.550	551.086	0.967	74
Single Threaded C -03	34.905	34.730	0.056	74
OpenMP
Pthread Mulithreaded
CUDA²	74.949	74.635	0.216	100
OpenCL?
Intel Xeon Phi³?

¹the python version crashed before finishing, I was too lazy to run it again and it's already clear that it is MUCH slower

²Running on a Tesla M40 @1.1Ghz. Gpu version uses aboslute difference instead of absolute squared difference, so it converges differently. Final means are very similar, but not identical. Currently it is quite a bit slower than the CPU version, but hopefully it will scale better with larger images. Also the Kernels are optimized for RGB data so it is actually duplicating the BW inputs 3 times to work properly, thus processing 3x the amount of data. So in theory you could divide the time by 3?

³ yes I have one. Why? for fun.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
get_memes_from_txt.sh		get_memes_from_txt.sh
kmeans.c		kmeans.c
kmeans.cu		kmeans.cu
kmeans.h		kmeans.h
kmeans.py		kmeans.py
makefile		makefile
memes_small_test		memes_small_test
memes_small_test.cpp		memes_small_test.cpp
mnist_download.py		mnist_download.py
mnist_test.c		mnist_test.c
mnist_test_gpu.cpp		mnist_test_gpu.cpp
requirements.txt		requirements.txt
resize_memes.sh		resize_memes.sh
scraper.py		scraper.py
show_means.py		show_means.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kmemes

About

Releases

Packages

Languages

License

jedbrooke/kmemes

Folders and files

Latest commit

History

Repository files navigation

Kmemes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages