Skip to content

Computer Use framework via code writing and execution instead of the commonly used playwright versions.

Notifications You must be signed in to change notification settings

aditKadepurkar/Delphi

Repository files navigation

Delphi

Overview

Delphi allows users to speak to their computer and have it complete actions for them. It does this via function calling using either xLam or gpt-4o. The agent also has a preliminarly feature that allows it to write and execute code(so use this repo at your own risk). The goal is to be useful to humans because speech is much more vibrant and natural for humans than keyboards and mice. Especially for older people and those with disabilities, speaking with the computer is often easier than the current options. This project is not actively maintained anymore because better alternatives exist.

Setup

run pip install -r requirements.txt

How to use

Ongoing list of dependencies, but the current working product is in 'pipeline_OpenAI.py' which requires a few things. 1. You need to have ollama installed and 2. You need to put an OpenAI api key in a .env file. You should expect it to take some time to run, but once it is setup you can type a query and it should do something.

Credits

By Adit Kadepurkar, Matthew Xu

About

Computer Use framework via code writing and execution instead of the commonly used playwright versions.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published