Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
61 commits
Select commit Hold shift + click to select a range
2a53d1c
PAsses single prompts through.
finbarrtimbers Aug 30, 2025
15a8e3f
Updated queue_types.py to match.
finbarrtimbers Aug 30, 2025
8c24d1b
Fixed issue with code.
finbarrtimbers Aug 30, 2025
f38f62c
Fixed queue sizing issue.
finbarrtimbers Aug 30, 2025
fc6dbf7
Updated length of tool use experiment.
finbarrtimbers Aug 30, 2025
68037ea
Merge branch 'main' into single-prompts
finbarrtimbers Aug 30, 2025
1e4f121
Merge branch 'main' into single-prompts
finbarrtimbers Aug 31, 2025
43eb277
Merge branch 'main' into single-prompts
finbarrtimbers Sep 1, 2025
66757f0
Merged conflicts
finbarrtimbers Sep 1, 2025
38ebe73
Corrected inference batch size calculation.
finbarrtimbers Sep 2, 2025
f25af3e
Fixed merge errors.
finbarrtimbers Sep 3, 2025
5702134
Undid changes ot test file.
finbarrtimbers Sep 3, 2025
aef0e6b
UNdid changes.
finbarrtimbers Sep 3, 2025
b4430d9
Cleaned up code
finbarrtimbers Sep 3, 2025
8f5523e
Another attempt to fix the dataset_index bug.
finbarrtimbers Sep 3, 2025
cc95a2f
Another attempt to fix the dataset_index bug.
finbarrtimbers Sep 3, 2025
47ddbbd
Added assert statements.
finbarrtimbers Sep 3, 2025
0f1999a
Removed debugging code.
finbarrtimbers Sep 3, 2025
05eab66
Less changes
finbarrtimbers Sep 3, 2025
c92b986
Undid changes
finbarrtimbers Sep 3, 2025
2c89ff4
Cleaned up PR.
finbarrtimbers Sep 3, 2025
aadd7d4
Fixed change in sort order
finbarrtimbers Sep 3, 2025
c2152ca
Clean up PR
finbarrtimbers Sep 3, 2025
4b604c5
Merge branch 'main' into single-prompts
finbarrtimbers Sep 3, 2025
67d3cca
Cleaned up code.
finbarrtimbers Sep 3, 2025
766e74d
Cleaned up PR.
finbarrtimbers Sep 3, 2025
d6e254b
Fixed issue.
finbarrtimbers Sep 3, 2025
69a5328
Merge branch 'main' into single-prompts
finbarrtimbers Sep 3, 2025
00e2616
Fixed linter errors.
finbarrtimbers Sep 3, 2025
75d04ed
Updated tool_grpo_fast.sh to use new workspace.
finbarrtimbers Sep 3, 2025
209873b
Removed redundant test.
finbarrtimbers Sep 3, 2025
f72ae74
Added back whitespace.
finbarrtimbers Sep 3, 2025
34b4a84
Ran linter.
finbarrtimbers Sep 3, 2025
fe995e7
Refactored code.
finbarrtimbers Sep 3, 2025
1d78345
Cleaned up PR.
finbarrtimbers Sep 3, 2025
7abc342
Fixed linter error.
finbarrtimbers Sep 3, 2025
f5764bd
Removed logging.
finbarrtimbers Sep 3, 2025
e86e719
Removed logging statement.
finbarrtimbers Sep 3, 2025
07028b2
Attempt at fix mask mismatch issue.
finbarrtimbers Sep 4, 2025
68ab970
Tests should pass now.
finbarrtimbers Sep 4, 2025
dc3ffdf
Updated timing code.
finbarrtimbers Sep 4, 2025
c0a005d
Ran linter.
finbarrtimbers Sep 4, 2025
b3f8b5d
Added timing.
finbarrtimbers Sep 4, 2025
f33382b
Timing is fast now.
finbarrtimbers Sep 4, 2025
6dbae0e
Remove timing instrumentation code
finbarrtimbers Sep 4, 2025
6f05e37
Added lots of debugging statements.
finbarrtimbers Sep 4, 2025
66ab15d
Ran linter. Fixed bug.
finbarrtimbers Sep 5, 2025
7d600ec
Added test file
finbarrtimbers Sep 5, 2025
933ca5a
Removed whitespace
finbarrtimbers Sep 5, 2025
89e7b7b
Updated script.
finbarrtimbers Sep 5, 2025
ee32186
Cleaned up code.
finbarrtimbers Sep 5, 2025
1fa7f78
Merge branch 'main' into single-prompts
finbarrtimbers Sep 5, 2025
fcd6e1a
Removed debugging code.
finbarrtimbers Sep 5, 2025
a367ba3
Fixed failing test.
finbarrtimbers Sep 5, 2025
ffc600b
Set timeout for tests. They should take 5 minutes to run.
finbarrtimbers Sep 5, 2025
35ad725
Now, tests should pass.
finbarrtimbers Sep 5, 2025
7c4d260
now tests should pass.
finbarrtimbers Sep 5, 2025
6c47405
Linter passes.
finbarrtimbers Sep 5, 2025
78b2df2
now, tests should pass
finbarrtimbers Sep 5, 2025
ea1dbdc
now, tests should actually pass
finbarrtimbers Sep 5, 2025
cdb3af3
now, tests should actually pass
finbarrtimbers Sep 5, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
31 changes: 16 additions & 15 deletions .github/workflows/tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -11,13 +11,14 @@ jobs:
unit-tests:
name: Run unit tests
runs-on: ubuntu-latest
timeout-minutes: 20
steps:

- name: Delete huge unnecessary tools folder
run: |
echo "Disk space before cleanup:"
df -h /

# Remove unnecessary directories
DIRS_TO_REMOVE=(
"/opt/hostedtoolcache"
Expand All @@ -26,25 +27,25 @@ jobs:
"/usr/local/lib/android"
"/usr/local/share/boost"
)

for dir in "${DIRS_TO_REMOVE[@]}"; do
[ -d "$dir" ] && sudo rm -rf "$dir" && echo "Removed: $dir"
done

# Remove unnecessary packages
PACKAGES_TO_REMOVE=(
'^aspnetcore-.*' '^dotnet-.*' '^llvm-.*' 'php.*'
'^mongodb-.*' '^mysql-.*' azure-cli google-chrome-stable
firefox powershell mono-devel
)

sudo apt-get remove -y "${PACKAGES_TO_REMOVE[@]}" 2>/dev/null || true
sudo apt-get autoremove -y
sudo apt-get clean

# Clean up Docker
docker system prune -af --volumes

echo "Disk space after cleanup:"
df -h /

Expand All @@ -53,33 +54,33 @@ jobs:
# Install only the minimal CUDA toolkit needed for nvcc
sudo apt-get update
sudo apt-get install -y wget software-properties-common

# Add NVIDIA package repositories
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/cuda-keyring_1.1-1_all.deb
sudo dpkg -i cuda-keyring_1.1-1_all.deb
rm cuda-keyring_1.1-1_all.deb

# Install only cuda-nvcc package (much smaller than full toolkit)
sudo apt-get update
sudo apt-get install -y cuda-nvcc-12-6

# Clean up apt cache to save disk space
sudo apt-get clean
sudo rm -rf /var/lib/apt/lists/*

# Clean up apt cache to save disk space
sudo apt-get clean
sudo rm -rf /var/lib/apt/lists/*

# Create symlink so nvcc is available
ln -s /usr/local/cuda-12.6/bin/nvcc /usr/bin/nvcc || true

echo "nvcc installed, checking version:"
nvcc --version || echo "nvcc not found"

echo "Disk space after CUDA install and cleanup:"
df -h

- name: Checkout code
uses: actions/checkout@v4
- name: Install uv
Expand Down
Loading