Skip to content

TheScriptGuy/molasses-masses

Repository files navigation

🍯 Project Molasses-Masses

🌐 Overview

Project Molasses-Masses is designed to provide a dynamically updated list of hosting providers from around the world. This list is based on the prefixes advertised as part of the BGP Autonomous System numbers.

Acknowledging that hosting providers themselves are not inherently problematic, some of their customers may have malicious intentions. Therefore, we adopt a "block first" approach, creating exceptions to allow legitimate traffic through as needed.

This project is currently tracking 12069 IPv4 Subnets and 1482 IPv6 Subnets.

🛠️ How It Works

The project uses a Python script (molasses_masses.py) to fetch and process IP prefixes associated with various hosting providers.

Input

  • The script reads from an input file named hosting-providers-source.txt.
  • This file should contain a list of tuples with hosting provider names and their AS numbers.

Output

The script generates output in the data directory:

  1. 📄 combined-v4.txt: A list of all IPv4 prefixes (View on Cloudflare)
  2. 📄 combined-v6.txt: A list of all IPv6 prefixes (View on Cloudflare)
  3. 📔 Source data for the day

These files are published to Cloudflare for easy access and distribution.

🚀 Usage

To run the script:

python molasses_masses.py --if hosting-providers-source.txt --od data --verbose --combined --summarize

Options:

  • --if: Input file (required)
  • --od: Output directory (required)
  • --verbose: Enable verbose logging
  • --combined: Create combined output files
  • --summarize: Summarize IP addresses

IP and Subnet Search Tool

This Python script searches for an exact match or the presence of IP addresses or subnets within files or directories containing a list of IPs/subnets. It's designed to help you quickly locate IP addresses or subnets from large datasets.

▶️ Usage

To use this script, you need to provide either a directory or a file to search, along with the IP addresses or subnets you want to search for.

Command-line Arguments

  • --directory : Directory to search for files containing IP addresses or subnets.
  • --file : Specific file to search.
  • --search : Comma-separated list of IP addresses and/or subnets to search for. (Required)
  • --verbose : Enable verbose output for detailed processing information.

Examples

Example 1: Search a Directory

To search all files in the data directory for the IP address 192.100.0.0/24, run:

python script.py --directory data --search 192.100.0.0/24

Example 2: Search a Specific File

To search a specific file, such as data/2024-09-09/global-communications-net-AS12615-v4.txt, for multiple IP addresses or subnets, run:

python script.py --file data/2024-09-09/global-communications-net-AS12615-v4.txt --search 192.100.1.0/24,100.0.0.0/8

Example 3: Search with Verbose Output

To search the data directory with verbose output, run:

python script.py --directory data --search 192.100.1.0/24 --verbose

📄 Example Output

When running the script with a search query, you might see output similar to the following:

Searching directory: data
Searching file: data/2024-09-09/global-communications-net-AS12615-v4.txt
Skipping invalid entry on line 5: InvalidSubnet/33
data/2024-09-09/global-communications-net-AS12615-v4.txt: line 10: 192.100.1.0/24
data/2024-09-09/amanah-tech-AS32489-v4.txt: line 15: 100.0.0.0/8
...

💡 Important Notes

At least one of --directory or --file must be provided. The script requires at least one input source to search.
The --search argument is required and should contain a comma-separated list of valid IP addresses or subnets.
The script skips invalid IP addresses or subnets found in files and will display a message if the --verbose flag is enabled.

🔧 Additional Information

If the --directory argument is provided, the script will recursively search through all files within the directory (excluding hidden files and directories).
If the --file argument is used, only that specific file will be searched.
The search is performed using Python's ipaddress module, which checks for both exact matches and subnet overlaps.

Add Hosting Provider Script 💻

This Python script (add_hp.py) allows you to easily add new hosting provider entries to the hosting-providers-source.txt file. It's designed to maintain a consistent format and prevent duplicate entries.

🚀 Features

  • Add new hosting providers with their AS (Autonomous System) numbers
  • Prevent duplicate AS number entries
  • Maintain consistent file format
  • Provide feedback on successful additions and total entries

📖 Usage

  1. 📂 Ensure that add_hp.py and hosting-providers-source.txt are in the same directory.

  2. 💻 Open a terminal or command prompt and navigate to the directory containing the script.

  3. ⌨️ Run the script using the following command format:

    python add_hp.py <name> <as_number>
    

    Replace <name> with the hosting provider's name and <as_number> with their AS number.

    Example:

    python add_hp.py example-host 12345
    
  4. ✅ If successful, you'll see a confirmation message and the total number of entries in the file.

⚠️ Input Validation

  • The name can contain letters, numbers, hyphens, and underscores.
  • The as_number must be a positive integer.

❌ Error Handling

The script will display error messages for the following situations:

  • Incorrect number of arguments
  • Invalid name format
  • Invalid AS number
  • Duplicate AS number in the existing file

📁 File Format

The hosting-providers-source.txt file maintains the following format:

[
("provider1", 12345),
("provider2", 67890),
...
("last-provider", 99999)
]

⚠️ Disclaimer

This project is not meant to imply that all traffic from hosting providers is malicious. It's a tool for those who prefer a cautious approach to network traffic, allowing for precise control and whitelisting as needed.

🤝 Contributing

Contributions to Project Molasses-Masses are welcome! Please feel free to submit pull requests or open issues to improve the project.

About

A repostitory of hosting providers

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages