🧩 IT Conferences Crawler Project

Objective:

The main goal of this project is to build a data crawler system to collect information about reputable information technology conferences worldwide.
This information may include conference names, deadlines, dates, locations, themes, and additional details such as speaker lists, accepted papers, and registration information.

Suggested Steps:

1. Plan and Design the System:

Identify a list of websites or data sources from which you want to gather information.
Design the structure of the database to store this information.

2. Develop the Crawler:

Build a program or script capable of automatically navigating websites, searching for information about conferences, and extracting data from these web pages.
Suggested tools/libraries: Consider using tools like Scrapy or BeautifulSoup in Python, or any language suitable for your team's skills.

3. Data Processing:

After collecting the data, process and filter the information to eliminate unnecessary data and ensure data accuracy.

4. Store Data:

Store the collected data in a database or file for future use.

5. User Interface Integration (Optional):

If necessary, build a user interface to interact with the collected data.

Challenges and Difficulties:

Some websites may have anti-crawling mechanisms or limitations on access speed.
Data on websites may change frequently, requiring regular updates.
Data may not be presented in an easily analyzable format.

Applications:

This project can be beneficial for academic researchers, conference organizing entities, or individuals interested in tracking global conference events.

Note:

Data collection from websites should adhere to copyright regulations and website policies. The use of collected information may need to comply with legal regulations and the specific policies of each website.

References:

CCFDDL
LIX Polytechnique
Link to crawler with scrapy: go here

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
public		public
src		src
.eslintrc.cjs		.eslintrc.cjs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧩 IT Conferences Crawler Project

Objective:

Suggested Steps:

1. Plan and Design the System:

2. Develop the Crawler:

3. Data Processing:

4. Store Data:

5. User Interface Integration (Optional):

Challenges and Difficulties:

Applications:

References:

About

Releases

Packages

Languages

License

fit-hcmus-k21/conferences-crawler-fe

Folders and files

Latest commit

History

Repository files navigation

🧩 IT Conferences Crawler Project

Objective:

Suggested Steps:

1. Plan and Design the System:

2. Develop the Crawler:

3. Data Processing:

4. Store Data:

5. User Interface Integration (Optional):

Challenges and Difficulties:

Applications:

References:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages