You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'd be great to be able to easily re-check a pair of clones from the output directory. Based on bookkeeping we should be able to get file paths of the clones.
Even more awesome would be to generate a link to the files on GitHub based on metadata (like default_branch, namespace, reponame) from dataset (if it's github dataset).
I've written a simple script that works on my sample dataset of top 1000 GitHub JS repos. It's in SourcererCC fomat. I'll generalize it when I have more time: jakubzitny@0111c6a
I'd be great to be able to easily re-check a pair of clones from the output directory. Based on bookkeeping we should be able to get file paths of the clones.
Even more awesome would be to generate a link to the files on GitHub based on metadata (like default_branch, namespace, reponame) from dataset (if it's github dataset).
I've written a simple script that works on my sample dataset of top 1000 GitHub JS repos. It's in SourcererCC fomat. I'll generalize it when I have more time: jakubzitny@0111c6a
Sample output from it looks like this:
Some examples of JS clones: jakubzitny@5d20786735f14f5f73af4a82a6c6c90d.
The text was updated successfully, but these errors were encountered: