Harvester logic
The logic implements in the Automaton
class is the following
- Ask to the store, all the
record_id
satisfying the user request. - Reject
record_id
matching theorigin
field of a database entry. - Request to the store, the XML description of the publication and decode it.
- Check that the oai of the publication is defined and well formed. Recover it if it is not the case. From time to time, the
id
encoded in theoai
field is different from therecord
id. This happens when an old record is redirected to new one for obscure reasons. The record is ignore if a database entry is found with the bad OAI. - Reject temporarily publication.
- Check that authors are defined. Reject the publication if it is not the case.
- Check that my institute is in the list of the institutes signing the publication. Reject the publication if it is not the case. When the affiliation are not defined, try to recover this case, by finding the author of my institute signing the publication. This recovery procedure uses the author rescue list. Reject the record when the recovery procedure failed.
- Check that the collaboration, if defined, is well formed. Reject the publication if it is not the case
- Several check are applied depending on the publication type.
- At the end of this process, the publisher, the authors are formatted and the list of signatories of my institute extracted.