Skip to content

A simple and powerful proxy scraping and validation tool that helps you extract, validate, and manage proxies from various sources. Perfect for developers and enthusiasts who need to work with proxies in an efficient way! 🌟

License

Notifications You must be signed in to change notification settings

javadbazokar/Proxy-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Proxy Scraper 2024 πŸš€

A simple and powerful proxy scraping and validation tool that helps you extract, validate, and manage proxies from various sources. Perfect for developers and enthusiasts who need to work with proxies in an efficient way! 🌟


Features ✨

  • Scrape Proxies: Fetch proxies from a list of URLs.
  • Validate Proxies: Ensure the proxies meet specific patterns and are valid.
  • Live Tracking: Monitor success and failure counts in real-time.
  • Downloadable Results: Easily download the validated proxy list.
  • Process Control: Start, stop, and manage the scraping process with ease.

Requirements πŸ› οΈ

  • PHP: Version 7.4 or higher.
  • JavaScript: Modern browser support with ES6 compatibility.
  • Server: Apache or Nginx with write permissions enabled for the project directory.
  • Additional Tools: cURL must be enabled on your server.

Installation πŸ“¦

  1. Clone the Repository:

    git clone https://github.com/yourusername/proxy-scraper.git
  2. Navigate to the Project Directory:

    cd proxy-scraper
  3. Set Permissions: Ensure the proxies.txt file is writable:

    chmod 666 proxies.txt
  4. Start Your Server:

    • If using XAMPP:
      • Place the project folder in the htdocs directory.
      • Start Apache and MySQL from the XAMPP control panel.
    • If using cPanel:
      • Upload the project folder to your public directory.

Usage 🚦

  1. Open the project in your browser by navigating to:

    http://localhost/proxy-scraper/
    
  2. Interface Overview:

    • Click the Start button to begin the scraping process.
    • View live counts for successful and failed links, total links, and unique proxies.
    • Use the Stop button to halt the process at any time.
  3. Download Results:

    • Once the process is complete (or stopped), a Download Proxies button will appear.
    • Click it to download the proxies.txt file.

File Structure πŸ“

proxy-scraper/
β”œβ”€β”€ assets/
β”‚   β”œβ”€β”€ links.json        # Input file containing the list of URLs to scrape
β”‚   β”œβ”€β”€ script.js         # Frontend JavaScript for managing the process
β”‚   β”œβ”€β”€ style.css         # Styling for the interface
β”‚   └── proxy_count.php   # Returns the count of saved proxies
β”œβ”€β”€ index.html            # Main interface
β”œβ”€β”€ save_link.php         # Handles link validation and proxy saving
β”œβ”€β”€ proxies.txt           # Output file for validated proxies
└── README.md             # Documentation

Configuration βš™οΈ

  1. links.json: Add your list of URLs in the following format:

    {
        "links": [
            "http://example.com/proxies1.txt",
            "http://example.com/proxies2.txt"
        ]
    }
  2. save_link.php:

    • Customize the regex pattern for proxy validation if needed:
      $proxyPattern = '/^([a-zA-Z0-9.-]+):([0-9]{1,5})$/';

Contributing 🀝

  1. Fork the repository.
  2. Create a new branch:
    git checkout -b feature-name
  3. Commit your changes:
    git commit -m "Add new feature"
  4. Push to your branch:
    git push origin feature-name
  5. Open a pull request.

License πŸ“œ

This project is licensed under the MIT License.


Support πŸ’¬

For any issues, feel free to open an issue on GitHub or contact me at [email protected].

Happy Scraping! πŸŽ‰

About

A simple and powerful proxy scraping and validation tool that helps you extract, validate, and manage proxies from various sources. Perfect for developers and enthusiasts who need to work with proxies in an efficient way! 🌟

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published