Skip to content

frubini/ukbsisharvester

 
 

Repository files navigation

ukbsisharvester

This script extract data from Narcis API

Dependencies

PYOAI

This project makes use of https://github.com/infrae/pyoai/, the library is directly added to this source code (commit 4800af53b2ed096a0305eed1d6710138a65eabcd 8/3/22)

Installation :

  1. Clone the repository
  2. install pip end pipenv
  3. cd /you/project/dir/
  4. run pipenv install
  5. run pipenv shell
  6. Configure the relevant options in the config.py file
  7. Run the harvester

Running the harvester

You can run the harvest task by modifying the main() function of harvest.py and then run python harverst.py You can run one of the scripts without arguments, or modify the script to change the start date for the harvesting process:

  • harvest_initial.py: to run the initial harvest until today @00:00, you can modify the start date in the script
  • harvest_daily.py: to run the harvest only from yesterday
  • harvest_count.py: count number of available record per day and write them to a csv
  • count_narcis_totals.py: it extracts the amount of articles from the filter menu Narcis website (only document type articles )

Customization

if you need different metadata modify the function: get_client() in harvest.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 90.9%
  • HTML 7.5%
  • CSS 1.6%