Synch uploaded jsonl files to HuggingFace

Synch uploaded jsonl files to HuggingFace #2

Workflow file for this run

.github/workflows/push-dataset.yml at 5538d27

	name: Synch uploaded jsonl files to HuggingFace

	on:
	workflow_call:
	inputs:
	datasource:
	type: string
	required: true
	db_user:
	type: string
	required: true
	db_password:
	type: string
	required: true
	db_host:
	type: string
	required: true
	workflow_dispatch: # allow manual triggering
	inputs:
	datasource:
	description: 'The datasource to process'
	type: choice
	default: all
	options:
	- all
	- agentmodels
	- aiimpacts
	- aisafety.camp
	- aisafety.info
	- ai_alignment_playlist
	- ai_explained
	- ai_safety_talks
	- ai_safety_reading_group
	- ai_tech_tu_delft
	- alignmentforum
	- arbital
	- arxiv
	- carado.moe
	- cold_takes
	- deepmind_blog
	- deepmind_technical_blog
	- distill
	- eaforum
	- eleuther.ai
	- gdocs
	- generative.ink
	- gwern_blog
	- html_articles
	- importai
	- indices
	- jsteinhardt_blog
	- lesswrong
	- markdown
	- miri
	- ml_safety_newsletter
	- openai.research
	- pdfs
	- rob_miles_ai_safety
	- special_docs
	- vkrakovna_blog
	- yudkowsky_blog
	- xmls

	jobs:
	generate-dataset:
	runs-on: ubuntu-latest

	steps:
	- name: Checkout repository
	uses: actions/checkout@v2

	- name: Setup Python environment
	uses: actions/setup-python@v2
	with:
	python-version: '3.x'

	- name: Install dependencies
	run: pip install -r requirements.txt

	- name: Generate dataset file
	env:
	ARD_DB_USER: ${{ secrets.ARD_DB_USER \|\| inputs.db_user }}
	ARD_DB_PASSWORD: ${{ secrets.ARD_DB_PASSWORD \|\| inputs.db_password }}
	ARD_DB_HOST: ${{ secrets.ARD_DB_HOST \|\| inputs.db_host }}
	ARD_DB_NAME: alignment_research_dataset
	run: python main.py generate_jsonl_files ${{ inputs.datasource }}

	- name: Setup Huggingface client
	run: pip install huggingface_hub gdown jsonlines datasets

	- name: Upload files
	run: python upload_to_huggingface.py ${{ secrets.HUGGINGFACE_TOKEN }} ${{ inputs.datasource }}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Synch uploaded jsonl files to HuggingFace #2

Workflow file

Synch uploaded jsonl files to HuggingFace #2

Jobs

Run details

Workflow file for this run