Fully Funded PhD Position Web Scraper with basic filter

Author: Lorenz Krause

Date: 2025-07-11

Description:

This script scrapes PhD positions from the first five pages of the website Fellowship Board. These positions are first saved in a CSV file. After that, there are two options to process the data:

Use the data_processing.ipynb Jupyter notebook to filter and format the PhD positions based on specified keywords and countries.
Use the chat_gpt_summary.py script to send the scraped data to OpenAI's GPT model for filtering using natural language.

Usage:

Navigate to the project directory:
```
 cd /path/to/PhD Scraper
```
Install the required libraries:
```
 pip install -r requirements.txt
```
Run the scraper:
```
 python3 phd_scraper.py
```
Process the data:
1. Using the chat_gpt_summary.py script:
First add a .env file with your OpenAI API key to the project directory:
```
 OPENAI_API_KEY=your_api_key_here
```
Then run the script:
```
 python3 chat_gpt_summary.py
```
Your put in your prompt in the terminal when running the script.
1. Using the data_processing.ipynb Jupyter notebook:
Open the data_processing.ipynb notebook, specify the keywords and countries to exclude at the top and run the whole notebook to filter and format the PhD positions.
Find the filtered results in the output folder (filtered_phd_positions.txt for the data_processing.ipynb notebook and gpt_summary.txt for the chat_gpt_summary.py script).

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
README.md		README.md
chat_gpt_summary.py		chat_gpt_summary.py
data_processing.ipynb		data_processing.ipynb
phd_scraper.py		phd_scraper.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fully Funded PhD Position Web Scraper with basic filter

Author: Lorenz Krause

Date: 2025-07-11

Description:

Usage:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Fully Funded PhD Position Web Scraper with basic filter

Author: Lorenz Krause

Date: 2025-07-11

Description:

Usage:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages