Communication Benchmark Models

This module contains simple tensorflow models that can be used for investigating communication costs during a distrbuted tensorflow computation.

Running The Distributed Model

The distributed communication model is implemented in parbuffer.py. To run the model to send a variable between geeker-3 and geeker-4 create run the following:

# To time the commnication of a variable of size 4096 between geeker-3 and geeker-4

# On geeker-4 (parameter server)
$ python parbuffer.py --variable_size=4096 --batch_size=100 --node_name=ps

# On geeker-3
$ python parbuffer.py --variable_size=4096 --batch_size=100 >> results

Running The Single-Machine Model

To test the baseline for the cheap operation on the variable tf.round(y) you can time the code on a single machine using the buffer.py script

$ python buffer.py --variable_size=4096 --batch_size=100

TODO

combine scripts into one module
add a parsing script

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
buffer.py		buffer.py
parbuffer.py		parbuffer.py
run_buffer.sh		run_buffer.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Communication Benchmark Models

Running The Distributed Model

Running The Single-Machine Model

TODO

About

Uh oh!

Releases

Packages

Languages

ericox/models

Folders and files

Latest commit

History

Repository files navigation

Communication Benchmark Models

Running The Distributed Model

Running The Single-Machine Model

TODO

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages