Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
rustil authored Apr 6, 2018
1 parent 3f5605b commit 0f13ad4
Showing 1 changed file with 24 additions and 52 deletions.
76 changes: 24 additions & 52 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,52 +1,24 @@
# 2018 IML workshop hackathon

[![twitter][twitter_badge]][hashtag_link]
[![mattermost][mattermost_badge]][mattermost_link]
[![indico][indico_badge]][indico_link]

## purpose of this repository

We would like the hackathon to work with little moderation from the IML
coordinators side. E.g. hacking projects should not be submitted to and then
approved by us, proposals should be visible to all participants to give
feedback, make suggestions, etc.

## how to propose a project

Our Idea is to have projects submitted as pull requests. ([How to create pull requests][prhowto])The actual content of
the pull request is not so important at this stage (just create a subdirectory
and put useful files inside) the main discussion and attraction of participants
should happen in the pull request discussion.

There is a template for pull request descriptions in place, the general idea is to describe:

- What do you want to do? What is the project about?
- Will you work on the project yourself or is it a suggestion for somebody to pick up?
- What prerequisites should/could participants bring with them? / What kind of know-how are you lacking that ideally a participant would contribute (Maybe you want to add a functionality to your favourite ML library and it would be good to know your way around its source code?)
- Are there previous works to build upon? Is there something that can be prepared in advance (set up software installation / download dataset)?

## outcome

Whatever it is your project aims for (a study, a learning experience, a new
tool, a feature to an existing tool) we would be happy if you can present
your achievement in one of the upcoming IML meetings.









[twitter_badge]: https://img.shields.io/badge/twitter-%23IMLworkshop-green.svg?style=social&logo=twitter
[tweet_link]: https://twitter.com/intent/tweet?hashtags=IMLworkshop
[hashtag_link]: https://twitter.com/hashtag/IMLworkshop

[mattermost_badge]: https://img.shields.io/badge/chat-mattermost-ff66cc.svg
[mattermost_link]: https://mattermost.web.cern.ch/iml/channels/hackathon

[indico_link]: https://indico.cern.ch/e/IML2018
[indico_badge]: https://img.shields.io/badge/agenda-indico-blue.svg?logo=data%3Aimage%2Fpng%3Bbase64%2CiVBORw0KGgoAAAANSUhEUgAAABQAAAAUCAMAAAC6V%2B0%2FAAAABGdBTUEAALGPC%2FxhBQAAACBjSFJNAAB6JgAAgIQAAPoAAACA6AAAdTAAAOpgAAA6mAAAF3CculE8AAABX1BMVEUAKTkAKDgAIjIAHS4AGywAITIAJzgAJTYBKTpngImzv8TO1tm5xchyiJIPNUQAJDUAJjZMaXTw8vP09fbDzdGfrrW5xcnx9PReeIEELTwCKzqTpauMnqYTOUgAGSoAHC0HLz98kZn4%2Bvp4jpYAJzcAHi8RN0YIMD8AIDEAIjMWO0oAIzMAJTVXcn3%2F%2F%2F8VOknJ0tb6%2Bvv2%2BPhAXmtAX2vm6uv5%2Bvrp7O5Oa3YAITGotrzd4uXg5efX3N9Sbnh6j5g%2FXmu1wMVJZ3KltLn7%2B%2Fw5WWUxUl78%2FPwxU19MaHQQNkYAGClwh5Dz9fYDKzvN1dmaq7BVcHsAJDRPbHYAFieruL3Y3uEAECIAGSnH0dWdrbMFLT3o7O3J0tU5WGZ2jJWOoac1VmKQo6n7%2FPxEYm4hRVPb4OKAlZ0nSlfs7vDz9PZjfIYAHzAiRlMXPEsBKjoELDsAIDACKzuVpq1NaXTXWV2ZAAAAAWJLR0QuVNMQhwAAAAlwSFlzAAALEwAACxMBAJqcGAAAAAd0SU1FB%2BIBDBElAvv3%2Fq0AAADUSURBVBjTY2AgFTAyMbMws7KhiLFzcHJx8%2FDy8SOJCQgKCYuIigmLS0giBKWkZWTl5FkUFJWUYWpVVNXUNTT5tbRZdXT1BLUhYvoGhkbGJqZm5hYallbWrGB7GWxs7ewd9BydnF1cWd3cPQSAgvyeel4a3j6%2Bfqxy%2FgEamnqBQUDBoGA9dc2Q0LBwnYjIKJggY3RMbFy8XnxCYlJyiiVUO4NAalqyUnpGZnJWdg7MIqAoa26ePH9%2BRkEhwklAE4qKgY5nQXE8UG0JFm9iDRDsQUcEAAB4BSW68VhpGAAAACV0RVh0ZGF0ZTpjcmVhdGUAMjAxOC0wMS0xMlQxNzozNzowMiswMTowMKC%2F%2B60AAAAldEVYdGRhdGU6bW9kaWZ5ADIwMTgtMDEtMTJUMTc6Mzc6MDIrMDE6MDDR4kMRAAAAAElFTkSuQmCC

[prhowto]: https://root.cern.ch/creating-pull-request
# ROOT-C++-Python - Benchmarking, comparing, best practices

## problem to solve
More and more people see benefits in using ML techniques and in addition to that (or apart from that) they see the benefits from taking advantage of the large data science ecosystem around (scipy, numpy, pandas, matplotlib and many many more) in addition to their ROOT based analyses. But instead of then using these tools there seems to be a high level of caution mainly due to:

- people are not necessarily aware of easy ways to connect ROOT based data with python data science tools
- people fear that using python will be _significantly_ slower than the ROOT based approach
- strongly connected to that: people are not necessarily aware of how to parallelise in python

## desired outcome
The best case scenario would be to come out of this hackathon with a comprehensive but simple presentation (mini-tutorial), that shows best practices on how to integrate non-ROOT-tools in an overall ROOT based analysis, how to transfer data between the ecosystems and which includes some performance comparisons between the different approaches.
So basically a talk that can be used to mitigate the fears of stepping outside of a purely ROOT based analysis and that gives actual starting points on how to do that.

I think that the workload of this project would be threefold:
- actually compute some performance comparisons
- search for performance comparisons, tutorials, talks about the topic that already exist.. and add them to the repository (for the latter see e.g. https://github.com/ChristosChristofidis/awesome-deep-learning )
- compile a talk (maybe a notebook, maybe something else) with a high pedagogical value :)

## skills / knowledge needed (for the project, not per person)
- didactic skills
- literature research skills
- some programming skills
- ROOT
- Other data storage solutions

0 comments on commit 0f13ad4

Please sign in to comment.