These components are supposed to be dependent to packages from my previous repositories.
Use this command to build and deploy the containers:
sudo docker-compose up -d
The folder env contains environment variables for each supported online social network.
Depending on your implementation, there are variables that are global (like TwitterCredentials) and there are some that are specific for a service.
These variable names are shown in extractors.json and initialised inside constants.py in order to be used in a generic way and avoid to mess around inside the code.
- To use the variables in a large scaled extraction, you should initialise every variable mentioned in
extractors.jsonafter launching theshared contextsubsystem.
- Proxy server.
- Service choice depending on the query model.
- Submit tasks to workers.
- Choreographed extraction using context.
- Extraction template.
- Sending data to data transformers.
- Environment variables to initialise the extraction.
- Image Extraction.
- Youtube video downloading.
- Containerisation.
- Automation of deployment. (docker-compose)
- Smaller footprint.
- Proxy and worker components are dependent on the context component.
- Workers depend on the proxy.