Skip to content

Latest commit

 

History

History
68 lines (51 loc) · 2.25 KB

readme.md

File metadata and controls

68 lines (51 loc) · 2.25 KB

DOUex

DOU email extractor

Extracting company emails from DOU.UA

Application will be useful if you are going to send your CV to Ukrainian IT companies.

Installing

git clone https://github.com/Cepr0/dou-ex.git
cd dou-ex
mvn package

Usage

java -jar target/dou-ex-1.1-RELEASE.jar [category] [city]

For example:

Getting all company emails from DOU:

java -jar target/dou-ex-1.1-RELEASE.jar

Getting company emails with Java vacancies:

java -jar target/dou-ex-1.1-RELEASE.jar java

Getting company emails with Java vacancies for Kiev:

java -jar target/dou-ex-1.1-RELEASE.jar java Киев

Result of extraction is stored in data/dou-emails-[category]-[city]-{current date/time}.csv in UTF-8 charset.

Result example:

1;Ciklum;https://jobs.dou.ua/companies/ciklum/offices/;Киев;[email protected]
1;Ciklum;https://jobs.dou.ua/companies/ciklum/offices/;Харьков;[email protected]
1;Ciklum;https://jobs.dou.ua/companies/ciklum/offices/;Львов;[email protected]
1;Ciklum;https://jobs.dou.ua/companies/ciklum/offices/;Днепр;[email protected]
1;Ciklum;https://jobs.dou.ua/companies/ciklum/offices/;Одесса;[email protected]
1;Ciklum;https://jobs.dou.ua/companies/ciklum/offices/;Винница;[email protected]
1;Ciklum;https://jobs.dou.ua/companies/ciklum/offices/;Минск (Беларусь);[email protected]
2;EPAM;https://jobs.dou.ua/companies/epam-systems/offices/;Киев;[email protected]
2;EPAM;https://jobs.dou.ua/companies/epam-systems/offices/;Харьков;[email protected]
2;EPAM;https://jobs.dou.ua/companies/epam-systems/offices/;Львов;[email protected]
2;EPAM;https://jobs.dou.ua/companies/epam-systems/offices/;Днепр;[email protected]
2;EPAM;https://jobs.dou.ua/companies/epam-systems/offices/;Винница;[email protected]

Configuration

If you are behind the proxy or need to change delay between the loading a portion of the data you can change these parameters in config file config/dou.yml:

useProxy: true
proxyHost: localhost
proxyPort: 3128
# Delay between loading next portion of data from the site (min/max), ms
loadingDataDelay:
  - 1000
  - 3000