Well, we'd like to get the unique records in each file, ideally
knowing which file the records came from.
The long version --
Our holdings in our state's union catalog have gotten pretty
inaccurate over the years. But because of budget cuts, the State
Library is no longer performing batch deletes/ loads.
They can, however, do an extract for us (file #1). And we can
obviously export our records from our ILS (file #2), on which we've
just done a major clean up.
My thought was to isolate the unique records in each file & then have
a volunteer add or delete the appropriate record in the union catalog.
Then our holdings will be accurate -- great for ILL and and as a
worst-case backup.
What I can probably do is just add a custom field so that we can
easily tell which file a particular record came from.
Cheers,
Cab Vinton, Director
Sanbornton Public Library
Sanbornton, NH
"Politeness and consideration for others is like investing pennies and
getting dollars back." Thomas Sowell
On Thu, Sep 29, 2011 at 2:47 PM, Reese, Terry
<[log in to unmask]> wrote:
> Do you need to compare the two files together and get the unique files or find the unique records in each individual file? If it's the unique records between the two files, you could join the two files with MARC Join, and then open the file in the MarcEditor and select the Record Deduplication tool. If you have it print the Unique items, that will give you the items without any duplicate entry.
>
> If you want the duplicates in individual files -- you should just do the above process (without the join) on each file.
>
> What strikes me is I probably should pull this function out so that you don't need to run it solely in the MarcEditor.
>
> --TR
>
> -----Original Message-----
> From: MarcEdit support in technical and instructional matters [mailto:[log in to unmask]] On Behalf Of Cab Vinton
> Sent: Thursday, September 29, 2011 11:05 AM
> To: [log in to unmask]
> Subject: [MARCEDIT-L] Comparing files
>
> I'm looking for the easiest way of isolating the unique records in two large-ish files of MARC records (about 22k).
>
> I'm anticipating that about 90% of the records will be duplicated, but need to identify which are unique to each file.
>
> Is there a way to use MarcEdit for this? Or perhaps another utility is better suited?
>
> Thanks for any help!
>
> Cab Vinton, Director
> Sanbornton Public Library
> Sanbornton, NH
>
> "Politeness and consideration for others is like investing pennies and getting dollars back." Thomas Sowell
>
> ________________________________________________________________________
>
> This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and instructional support in MarcEdit. If you wish to communicate directly with the list owners, write to [log in to unmask] To unsubscribe, send a message "SIGNOFF MARCEDIT-L" to [log in to unmask]
>
> ________________________________________________________________________
>
> This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and instructional support in MarcEdit. If you wish to communicate directly with the list owners, write to [log in to unmask] To unsubscribe, send a message "SIGNOFF MARCEDIT-L" to [log in to unmask]
>
________________________________________________________________________
This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and instructional support in MarcEdit. If you wish to communicate directly with the list owners, write to [log in to unmask] To unsubscribe, send a message "SIGNOFF MARCEDIT-L" to [log in to unmask]
|