Interesting question -- I need to take a look because there was a time when
I had a process written to do this because I had to take records that
recorded multiple formats and break them into their format specific parts
for a tool I was working with.
I've actually been thinking about providing a plugin to help build the type
of data HathiTrust needs to make it easier for folks looking to submit data
to HathiTrust. My guess is that this will take a few different shapes.
Regardless, I'll see if I can find the work I'd done before, I could quickly
drop it into a plug that you could use if you want.
--tr
-----Original Message-----
From: MarcEdit support in technical and instructional matters
[mailto:[log in to unmask]] On Behalf Of McDonald, Stephen
Sent: Wednesday, May 20, 2015 3:15 PM
To: [log in to unmask]
Subject: Re: [MARCEDIT-L] duplicating records
Thanks, Jim.
Yes, I think you do grasp what I'm trying to do. I have done COM scripting
with MarcEdit before. I was looking to see if there were a simpler method,
but I don't think so. The trick with the 955 field is that it includes a
unique Internet Archive ID for each record and volume, which is why I was
thinking of matching. Putting the 955 fields into the MARC file first looks
like an improvement; the script can generate duplicate records for each 955
and matching 856 field, the way you suggest. I think this will work, if
there isn't any easier method.
I may use this opportunity to try my hand at the .NET API.
Steve McDonald
[log in to unmask]
-----Original Message-----
From: MarcEdit support in technical and instructional matters
[mailto:[log in to unmask]] On Behalf Of Jim Taylor
Sent: Wednesday, May 20, 2015 2:33 PM
To: [log in to unmask]
Subject: Re: duplicating records
Assuming I understand, it would seem simpler to use the COM capabilities of
MARCEdit and script the duplication process in that fashion. If you extract
the MARC for the ones you want to submit you would extract a record, count
the number of 856(?) tags and then duplicate that record the needed number
of times and add the 955. Not sure if you need to remove all but the
relevant 856 or not. That avoids having to match up anything. If you
haven't used this option before you can find the Documentation in the Help
file under "Help for Programmers". Let me know if you have any questions
and if you don't find a better way.
Jim
-----Original Message-----
From: MarcEdit support in technical and instructional matters
[mailto:[log in to unmask]] On Behalf Of McDonald, Stephen
Sent: Wednesday, May 20, 2015 11:36 AM
To: [log in to unmask]
Subject: duplicating records
I have a feeling someone has asked about this in the past. I am trying to
work out a procedure for preparing records of Internet Archive materials for
Hathitrust.
We have submitted a number of digitized materials to the Internet Archive.
Now we want records for these materials at Hathitrust. Hathitrust has a
complicated set of documentation for submitting metadata records, but most
of it is simply specs. What it boils down to is adding a 955 field and
submitting the records as MARC-XML. The tricky part is that they want a
separate record for each URL. In our catalog, multivolume items are
represented by a single MARC record, with multiple URLs. So when we send
this to Hathitrust, we need to duplicate this metadata record and add a
different 955 field to each one.
I know that I can translate the data from Internet Archive into a
comma-separated text file containing the OCLC number, the information for
the 955 field, and the URL; one line for each URL and Internet Archive ID.
I could use MarcEdit to translate that into a MARC file if I wanted. But
using MarcEdit to merge that file with my existing records will not
duplicate the MARC records; I end up with MARC records with multiple 955
fields and multiple URLs.
What I'm hoping to find is a way to create a separate MARC record for each
line in my text file. I can probably write a script to do it, but scripting
a method for matching the OCLC number column from the text file to the MARC
file will be a pain. Can anyone think of a simpler way to do it? Is there,
for instance, a way to script a series of Merge commands? I don't see any
method in the API for merging, nor can you put a Merge into a task.
Steve McDonald
[log in to unmask]
________________________________________________________________________
This message comes to you via MARCEDIT-L, a Listserv(R) list for technical
and instructional support in MarcEdit. If you wish to communicate directly
with the list owners, write to [log in to unmask] To
unsubscribe, send a message "SIGNOFF MARCEDIT-L" to
[log in to unmask]
________________________________________________________________________
This message comes to you via MARCEDIT-L, a Listserv(R) list for technical
and instructional support in MarcEdit. If you wish to communicate directly
with the list owners, write to [log in to unmask] To
unsubscribe, send a message "SIGNOFF MARCEDIT-L" to
[log in to unmask]
________________________________________________________________________
This message comes to you via MARCEDIT-L, a Listserv(R) list for technical
and instructional support in MarcEdit. If you wish to communicate directly
with the list owners, write to [log in to unmask] To
unsubscribe, send a message "SIGNOFF MARCEDIT-L" to
[log in to unmask]
________________________________________________________________________
This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and instructional support in MarcEdit. If you wish to communicate directly with the list owners, write to [log in to unmask] To unsubscribe, send a message "SIGNOFF MARCEDIT-L" to [log in to unmask]
|