Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
oai_harvesting_via_marcedit [2016/03/09 15:47]
kdion Major updates
oai_harvesting_via_marcedit [2016/03/16 14:41]
kdion updating URLs
Line 4: Line 4:
  
 ==To Harvest:== ==To Harvest:==
 +
 +Copy the OAIDCtoMARCXMLmodified.xsl file and Marc21slimUtils file from the R: drive (Theses -> OAI MarcEdit Crosswalk) to your own computer C: drive 
  
 Open MarcEdit. Make sure it is set to the SAXON.NET XSLT Engine due to XSLT 2.0 being used in the XML file.  (Go to Tools --> Preferences --> MarcEngine) Open MarcEdit. Make sure it is set to the SAXON.NET XSLT Engine due to XSLT 2.0 being used in the XML file.  (Go to Tools --> Preferences --> MarcEngine)
Line 12: Line 14:
 englmfa_theses //OR// larp_ms_projects englmfa_theses //OR// larp_ms_projects
   * Metadata Type: qdc   * Metadata Type: qdc
-  * Crosswalk Path: C:\Temp\XML1\OAIDCtoMARCXMLmodified.xsl  ​+  * Crosswalk Path: wherever it is on your C: drive (ex. C:Program Files\Crosswalk\XML1\OAIDCtoMARCXMLmodified.xsl 
         ​         ​
  (This is for the Qualifed Dublin Core records. Simple Dublin Core will not allow us to extract degree names nor departments.) ​       ​  (This is for the Qualifed Dublin Core records. Simple Dublin Core will not allow us to extract degree names nor departments.) ​       ​
Line 32: Line 34:
   * OAI_LARP   * OAI_LARP
  
-This will run the harvested records through the MarcEdit task list. Save the results to your hard drive as a .mrk file (ex: C:\Temp\OAI_Batch\MastersFeb2015.mrk)+This will run the harvested records through the MarcEdit task list. Save the results to your hard drive as a .mrk file (ex: C:\Crosswalk\Temp\OAIMastersFeb2015.mrk)
            
 ==Checking for Bad Characters== ==Checking for Bad Characters==
Line 48: Line 50:
 Click on the Marc Tools button and input: Click on the Marc Tools button and input:
  
-__Input file__: .mrk filename as above (C:\Temp\OAI_Batch\MastersFeb2015.mrk) +__Input file__: .mrk filename as above (C:\Crosswalk\Temp\MastersFeb2015.mrk) 
-__Output file__: change file type to .mm (C:\Temp\OAI_Batch\MastersFeb2015.mm)+__Output file__: change file type to .mm (C:\Crosswalk\Temp\MastersFeb2015.mm)
         ​         ​
 Select MarcMaker Select MarcMaker
Line 75: Line 77:
  
 The original MarcEdit OAIDCtoMarcXML file can be found on your hard drive under C:\Program Files\MarcEdit 6\xslt\OAIDCtoMARCXML.xsl or wherever your MarcEdit application version is.   This is the XML generic version .. don't change this; use the modified version, a copy of which can be found in the R drive under Theses\OAI MarcEdit XML harvest code (OAIDCtoMARCXMLmodified.xsl). Note that you must also have the Marc21slimUtils in the same folder in order for the .xsl file to run properly. The original MarcEdit OAIDCtoMarcXML file can be found on your hard drive under C:\Program Files\MarcEdit 6\xslt\OAIDCtoMARCXML.xsl or wherever your MarcEdit application version is.   This is the XML generic version .. don't change this; use the modified version, a copy of which can be found in the R drive under Theses\OAI MarcEdit XML harvest code (OAIDCtoMARCXMLmodified.xsl). Note that you must also have the Marc21slimUtils in the same folder in order for the .xsl file to run properly.
 +
 +The MarcEdit XML Stylesheet in the R:​\Theses\Crosswalk directory explains the purpose of each task performed by our XSLT crosswalk. ​
 + 
  
 The XML script is based on that generously shared by Ken Robinson (kjr106@psu.edu),​ Cataloging and Metadata Services, the Pennsylvania State University. ​ This file can be found online at [[https://​scholarsphere.psu.edu/​collections/​x346dj68d]] along with a detailed description of their eTD Dublin Core-to-MARCXML Crosswalk. ​   The XML script is based on that generously shared by Ken Robinson (kjr106@psu.edu),​ Cataloging and Metadata Services, the Pennsylvania State University. ​ This file can be found online at [[https://​scholarsphere.psu.edu/​collections/​x346dj68d]] along with a detailed description of their eTD Dublin Core-to-MARCXML Crosswalk. ​  
 +
  
  
Line 92: Line 98:
   * Inserts 653 fields for keywords and such.   * Inserts 653 fields for keywords and such.
   * Inserts a 655_7 field (Academic theses. |2 lcgft)   * Inserts a 655_7 field (Academic theses. |2 lcgft)
-  * Inserts a 690 field (Theses |x Chemistry |x Masters) ​ *NOTE:* The crosswalk script automatically adds x Masters but this will be changed to Doctoral as needed via MarcEdit Tools.)+  * Inserts a 690 field (Theses |x Chemistry |x Masters) ​ *NOTE:* The crosswalk script automatically adds |x Masters but this will be changed to Doctoral as needed via MarcEdit Tools.)
   * Inserts 700 fields for advisors ​   * Inserts 700 fields for advisors ​
   * Inserts a 710 field (University of Massachusetts Amherst, |e degree granting institution)   * Inserts a 710 field (University of Massachusetts Amherst, |e degree granting institution)
   * Inserts a 710 field (University of Massachusetts Amherst. Libraries, |e issuing body)   * Inserts a 710 field (University of Massachusetts Amherst. Libraries, |e issuing body)
   * Inserts a 856 field (Scholarworks URL with |z Link to free resource)   * Inserts a 856 field (Scholarworks URL with |z Link to free resource)
 +         
                    
  
-The __MarcEdit Task List__ does the following:+Our personalized ​__MarcEdit Task List__ does the following:
   ​   ​
-    * Adds an 008 field and corrects any necessary LDR fields +    * Adds an 008 field and corrects any necessary LDR fields. 
-    * Adds an 049 AUMM field +    * Adds an 049 AUMM field. 
-    * Corrects the 100 field to include a period and comma after an initial in the author'​s name+    * Corrects the 100 field to include a period and comma after an initial ​ 
 +        ​in the author'​s name.
     * Inserts a colon and |b where needed     * Inserts a colon and |b where needed
-    * Removes titles (Dr., Prof.) and '​Ph.D'​ from advisor names +    * Removes titles (Dr., Prof.) and '​Ph.D'​ from advisor names. 
-    * Reverses the form of advisor names to Lastname, Firstname and replaces |e contributor with |e advisor.  +    * Reverses the form of advisor names to Lastname, Firstname and replaces 
 +           |e contributor with |e advisor.  
 +    * Strips unwanted HTML tags from the 520 field. ​       
 +    * Cleans up any goofy stuff (i.e., Plant & Soil Sciences to Plant and Soil Sciences)
     * Coming: adding a 949 field for ALEPH holdings purposes     * Coming: adding a 949 field for ALEPH holdings purposes
  
oai_harvesting_via_marcedit.txt · Last modified: 2022/05/16 19:35 by jeustis
[unknown link type]Back to top
www.chimeric.de Creative Commons License Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0