Skip to content

Clone google searching tab, for studying purpose

Notifications You must be signed in to change notification settings

MagicDinosaur/Goooly

Repository files navigation

Gooly


Gooly

A Google-based searching bar You can try it on here : http://gooly.phamvietduc.com

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Roadmap
  5. Contributing
  6. License
  7. Contact
  8. Acknowledgments

Overview

Gooly is a google-clone search bar with support crawling tool from back-end. Here is the firstlook:

gif1

Behind the project is a tool that collects data from public websites and insert to the MySQL database. Here is how it looks:

Terminal

gif2

Database

image

(back to top)

Built With

(back to top)

Details

The idea of making this project

During my exposure to web programming, I noticed that most websites, especially electronic information sites, adhere to a common standard, in order to increase the number of vistors.
When inspecting information from these pages, we see the presence of html tags. These tags will cover the general content of the website such as title, subject image, topic, etc. For example, when I inspect the homepage of Cnet and The Verge, we could se that there are some similiar in naming html tags such as "og:site_name", "og:description",,... Inspecting Cnet image Inspecting the verge image

With that ideas in mind, I started developing a tool that could recursively call in the meta tags and anchor tags of public websites, and collect the data into the database. The collected data is served to search engines.

Installation

  1. Clone the repo
    git clone https://github.com/MagicDinosaur/Goooly.git
  2. Install NPM packages
    npm install
  3. Enter your API in config.js
    const API_KEY = 'ENTER YOUR API';

(back to top)

Roadmap

There are some exciting challenges that I could add into my project in the near future.

  • Add searching Image and voice
  • Redesign database for better searching
  • Implement Natural Language Processing ML models (GPT, BERT, Transformer) to enhance the quality of search queries
  • Optimize crawling engine (first try with asynchronous I/O)
  • Multi-language Support
    • Vietnamese
    • English

(back to top)

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion or feature idea that would make this better, please fork the repo and create a pull request.

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Contact

Duc Pham - - [email protected]

(back to top)

About

Clone google searching tab, for studying purpose

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published