This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
oai_harvesting_via_marcedit [2016/03/09 15:47] kdion Major updates |
oai_harvesting_via_marcedit [2016/03/16 14:56] kdion minor revision |
||
---|---|---|---|
Line 4: | Line 4: | ||
==To Harvest:== | ==To Harvest:== | ||
+ | |||
+ | Copy the OAIDCtoMARCXMLmodified.xsl file and Marc21slimUtils file from the R: drive (Theses -> OAI MarcEdit Crosswalk) to your own computer C: drive | ||
Open MarcEdit. Make sure it is set to the SAXON.NET XSLT Engine due to XSLT 2.0 being used in the XML file. (Go to Tools --> Preferences --> MarcEngine) | Open MarcEdit. Make sure it is set to the SAXON.NET XSLT Engine due to XSLT 2.0 being used in the XML file. (Go to Tools --> Preferences --> MarcEngine) | ||
Line 12: | Line 14: | ||
englmfa_theses //OR// larp_ms_projects | englmfa_theses //OR// larp_ms_projects | ||
* Metadata Type: qdc | * Metadata Type: qdc | ||
- | * Crosswalk Path: C:\Temp\XML1\OAIDCtoMARCXMLmodified.xsl | + | * Crosswalk Path: wherever it is on your C: drive (ex. C:Program Files\Crosswalk\XML1\OAIDCtoMARCXMLmodified.xsl) |
| | ||
(This is for the Qualifed Dublin Core records. Simple Dublin Core will not allow us to extract degree names nor departments.) | (This is for the Qualifed Dublin Core records. Simple Dublin Core will not allow us to extract degree names nor departments.) | ||
Line 32: | Line 34: | ||
* OAI_LARP | * OAI_LARP | ||
- | This will run the harvested records through the MarcEdit task list. Save the results to your hard drive as a .mrk file (ex: C:\Temp\OAI_Batch\MastersFeb2015.mrk) | + | This will run the harvested records through the MarcEdit task list. Save the results to your hard drive as a .mrk file (ex: C:\Crosswalk\Temp\OAIMastersFeb2015.mrk) |
==Checking for Bad Characters== | ==Checking for Bad Characters== | ||
Line 48: | Line 50: | ||
Click on the Marc Tools button and input: | Click on the Marc Tools button and input: | ||
- | __Input file__: .mrk filename as above (C:\Temp\OAI_Batch\MastersFeb2015.mrk) | + | __Input file__: .mrk filename as above (C:\Crosswalk\Temp\MastersFeb2015.mrk) |
- | __Output file__: change file type to .mm (C:\Temp\OAI_Batch\MastersFeb2015.mm) | + | __Output file__: change file type to .mm (C:\Crosswalk\Temp\MastersFeb2015.mm) |
| | ||
Select MarcMaker | Select MarcMaker | ||
Line 76: | Line 78: | ||
The original MarcEdit OAIDCtoMarcXML file can be found on your hard drive under C:\Program Files\MarcEdit 6\xslt\OAIDCtoMARCXML.xsl or wherever your MarcEdit application version is. This is the XML generic version .. don't change this; use the modified version, a copy of which can be found in the R drive under Theses\OAI MarcEdit XML harvest code (OAIDCtoMARCXMLmodified.xsl). Note that you must also have the Marc21slimUtils in the same folder in order for the .xsl file to run properly. | The original MarcEdit OAIDCtoMarcXML file can be found on your hard drive under C:\Program Files\MarcEdit 6\xslt\OAIDCtoMARCXML.xsl or wherever your MarcEdit application version is. This is the XML generic version .. don't change this; use the modified version, a copy of which can be found in the R drive under Theses\OAI MarcEdit XML harvest code (OAIDCtoMARCXMLmodified.xsl). Note that you must also have the Marc21slimUtils in the same folder in order for the .xsl file to run properly. | ||
- | The XML script is based on that generously shared by Ken Robinson (kjr106@psu.edu), Cataloging and Metadata Services, the Pennsylvania State University. This file can be found online at [[https://scholarsphere.psu.edu/collections/x346dj68d]] along with a detailed description of their eTD Dublin Core-to-MARCXML Crosswalk. | + | |
+ | The XSLT script is based on that generously shared by Ken Robinson (kjr106@psu.edu), Cataloging and Metadata Services, the Pennsylvania State University. This file can be found online at [[https://scholarsphere.psu.edu/collections/x346dj68d]] along with a detailed description of their eTD Dublin Core-to-MARCXML Crosswalk. | ||
Line 92: | Line 96: | ||
* Inserts 653 fields for keywords and such. | * Inserts 653 fields for keywords and such. | ||
* Inserts a 655_7 field (Academic theses. |2 lcgft) | * Inserts a 655_7 field (Academic theses. |2 lcgft) | ||
- | * Inserts a 690 field (Theses |x Chemistry |x Masters) *NOTE:* The crosswalk script automatically adds x Masters but this will be changed to Doctoral as needed via MarcEdit Tools.) | + | * Inserts a 690 field (Theses |x Chemistry |x Masters) *NOTE:* The crosswalk script automatically adds |x Masters but this will be changed to Doctoral as needed via MarcEdit Tools.) |
* Inserts 700 fields for advisors | * Inserts 700 fields for advisors | ||
* Inserts a 710 field (University of Massachusetts Amherst, |e degree granting institution) | * Inserts a 710 field (University of Massachusetts Amherst, |e degree granting institution) | ||
* Inserts a 710 field (University of Massachusetts Amherst. Libraries, |e issuing body) | * Inserts a 710 field (University of Massachusetts Amherst. Libraries, |e issuing body) | ||
* Inserts a 856 field (Scholarworks URL with |z Link to free resource) | * Inserts a 856 field (Scholarworks URL with |z Link to free resource) | ||
+ | |||
- | The __MarcEdit Task List__ does the following: | + | Our personalized __MarcEdit Task List__ does the following: |
| | ||
- | * Adds an 008 field and corrects any necessary LDR fields | + | * Adds an 008 field and corrects any necessary LDR fields. |
- | * Adds an 049 AUMM field | + | * Adds an 049 AUMM field. |
- | * Corrects the 100 field to include a period and comma after an initial in the author's name | + | * Corrects the 100 field to include a period and comma after an initial |
+ | in the author's name. | ||
* Inserts a colon and |b where needed | * Inserts a colon and |b where needed | ||
- | * Removes titles (Dr., Prof.) and 'Ph.D' from advisor names | + | * Removes titles (Dr., Prof.) and 'Ph.D' from advisor names. |
- | * Reverses the form of advisor names to Lastname, Firstname and replaces |e contributor with |e advisor. | + | * Reverses the form of advisor names to Lastname, Firstname and replaces |
+ | |e contributor with |e advisor. | ||
+ | * Strips unwanted HTML tags from the 520 field. | ||
+ | * Cleans up any goofy stuff (i.e., Plant & Soil Sciences to Plant and Soil Sciences) | ||
* Coming: adding a 949 field for ALEPH holdings purposes | * Coming: adding a 949 field for ALEPH holdings purposes | ||