Silicon-Superforecaster

silicon-superforecaster harnesses the collective intelligence of multiple large language models (LLMs) to deliver predictive insights and judgments across a wide array of topics. Drawing inspiration from the human brain's capability to serve as a judgement tool, silicon-superforecaster operates on a similar principle, utilizing LLMs as measuring devices.

Mars by 2030?

Measuring: What are the chances that humans will land on Mars by 2030?

SpaceX Starship Launches in 2024?

Measuring: How many SpaceX Starship launches reach space in 2024? (1) 1 (2) 2 (3) 3 (4) 4 (5) 5+

Background

silicon-superforecaster seeks to optimize decision-making through the aggregation of diverse LLM insights, guided by the principles outlined in "Superforecasters" and "Noise." This approach advocates for leveraging collective intelligence over individual expert analysis, integrating the analytical depth of LLMs to produce a balanced synthesis of expert insights and broad-based predictions.

Example Predictions and Judgments for Silicon-Forecaster

To showcase silicon-superforecaster's versatility in generating predictions and judgments across various domains, here are some example modifications to index.ts. These examples utilize both the Scale.Probability and Scale.Options to illustrate the platform's wide-ranging analytical capabilities.

Scale.Probability Examples

Politics: Evaluate the probability of Israel holding new elections by the end of 2024.

let population = new Population();
population.addAllModels(5);
measurement("Israel will hold new elections by the end of 2024", Scale.Probability, population).then((response) => {
    Summary.create(response, Scale.Probability, true /* verbose */);
});

Technology Adoption Rate: Evaluate the probability of quantum computing becoming mainstream in consumer electronics by 2030.

let population = new Population();
population.addAllModels(5);
measurement("What is the probability of mainstream adoption of quantum computing in consumer electronics by 2030?", Scale.Probability, population).then((response) => {
    Summary.create(response, Scale.Probability);
});

Environmental Goals: Assess the likelihood of meeting the Paris Agreement's global warming limit by 2050.

let population = new Population();
population.addAllModels(20);
measurement("What is the probability of achieving the Paris Agreement's goal of limiting global warming to 1.5 degrees Celsius above pre-industrial levels by 2050?", Scale.Probability, population).then((response) => {
    Summary.create(response, Scale.Probability);
});

Scale.Options Examples

Future of Work: Identify which sector is poised for the greatest growth due to automation in the next decade.

let population = new Population();
population.addStrongModels(20);
measurement("Which sector will experience the most significant growth due to automation in the next decade: (1) technology (2) healthcare (3) education (4) manufacturing?", Scale.Options, population).then((response) => {
    Summary.create(response, Scale.Options);
});

Space Exploration Milestones: Predict the next major milestone in space exploration by 2030.

let population = new Population();
population.addModel(Model.GPT4, 10);
population.addModel(Model.CLAUDE_SONNET, 10);
measurement("What will be the next major milestone in space exploration by 2030: (1) returning humans to the moon (2) launching a manned mission to Mars (3) discovering extraterrestrial life (4) establishing a permanent space station?", Scale.Options, population).then((response) => {
    Summary.create(response, Scale.Options);
});

Process outline

Setup phase: The platform is initialized with the necessary configurations and models.
Calibration phase: The models are calibrated to ensure the most recent data is available.
- Model creates a query to fetch the most recent data through the chosen tool
- Fetch the most recent data using the generated query
- Model summarizes the data for the purpose of judging / forecasting
Measurement phase: The calibrated models provide their predictions and judgments on the specified topic.
Summary phase: The platform synthesizes the outputs from the models into various histograms and summaries for user consumption.

Core Components

index.ts: Facilitates user interactions and manages the overall workflow of the platform.
measure.ts: Defines the framework for quantifying predictions and judgments, ensuring that outcomes from different populations of LLM judges are standardized and comparable.
measurement.ts: Handles the collection and documentation of outputs from specific populations of LLM judges, applying the standardization criteria to produce structured, comparable insights.
calibrate.ts: Calibration phase per model (currently support Wiki & Tavily Search) to get the most recent data available.
model.ts: Coordinates the input and integration of outputs from multiple LLMs.
population.ts: Describes the "population object," detailing the collective of LLM judges, including their distribution and management for particular measurements.
scale.ts: Provides mechanisms for adjusting the scope and scale of predictions for uniform analysis across different contexts.
summary.ts: Compiles the outputs into accessible summaries for users, synthesizing diverse model insights.

Getting Started

To begin using Silicon-Forecaster:

Clone the repository and install dependencies:

git clone <repository-url>
cd silicon-superforecaster
npm install

Create a `.env` File

it is important to create a .env file in the root directory of the project. The .env file should contain the following environment variables:

OPENAI_API_KEY=<your_key>
ANTHROPIC_API_KEY=<your_key>
REPLICATE_API_TOKEN=<your_key>
GOOGLE_API_KEY=<your_key>
TAVILY_API_KEY=<your_key>

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
screenshots		screenshots
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
calibrate.ts		calibrate.ts
estimator.ts		estimator.ts
index.ts		index.ts
measure.ts		measure.ts
measurement.ts		measurement.ts
model.ts		model.ts
package-lock.json		package-lock.json
package.json		package.json
playground.ts		playground.ts
population.ts		population.ts
scale.ts		scale.ts
summary.ts		summary.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Silicon-Superforecaster

Mars by 2030?

SpaceX Starship Launches in 2024?

Background

Example Predictions and Judgments for Silicon-Forecaster

Scale.Probability Examples

Scale.Options Examples

Process outline

Core Components

Getting Started

Create a `.env` File

About

Releases

Packages

Contributors 2

Languages

License

mfainstein/silicon-superforecaster

Folders and files

Latest commit

History

Repository files navigation

Silicon-Superforecaster

Mars by 2030?

SpaceX Starship Launches in 2024?

Background

Example Predictions and Judgments for Silicon-Forecaster

Scale.Probability Examples

Scale.Options Examples

Process outline

Core Components

Getting Started

Create a .env File

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Create a `.env` File

Packages