Review the affiliation mechanism
- Since version 0.8.14, the affiliation is based on the
short name
defined byinspirehep
for each institute (110__t
and110__u
). - The mechanism look for authors with a given affiliation using the institute
short name
. Here, we assumed that theshort name
is used both byinspirehep.net
andcds.cern.ch
to define author's affiliation. - Several exceptions have been found in
inspirehep.net
:- https://inspirehep.net/record/1409292
- https://inspirehep.net/record/1391152
- https://inspirehep.net/record/1421141
- https://inspirehep.net/record/1421133
- https://inspirehep.net/record/1318575
- https://inspirehep.net/record/1326994
- http://inspirehep.net/record/1318882
- http://inspirehep.net/record/1420148
- as well as in
cds.cern.ch
: - In addition, we learned that
cds.cern.ch
will change to use a CERN identifier related to their own database defining institutes. - Finally, we have the case in which author come from different institute, i.e LPC CAEN, ENSICAE but belong to the same entity.
- In the author MARC fields, author's affiliation are defined by the field
(100)700__u
. An other field exists,(100)700__v
which is left free to the cataloguer, but contains the full name (address) of the institute, in most of the case.
PROPOSAL
-
Add a new database table
affiliations
containing two columnsshort name
andfull name
. The first one would match the content of the field700__u
while the second one would match700__v
. One or the other can be undefined but not both. -
For each authors build the affiliation as
700_u+700_v
-
Build a regular expression with the content of the
affiliations
table. Something like:$full_name1^|$long_name2^|$full_name3Long_name3.^|...
-
Scan authors list to find a match.
-
In order to facilitate the construction of the affiliations table, a wizard have to be developed. It will relies on:
- the
inspirehep
notice - a list of authors belonging to the lab
- the
-
The wizard will propose a list of
(full_name, long name)
interrogatingcds.cern.ch
andinspirehep.net
. Meaning pairs will selected by the user. -
The wizard can be call at any time to add more values in the
affiliations
table. -
In principle, this approach is very general and should cover all cases.