MARCEDIT-L Archives

August 2010

MARCEDIT-L@LISTSERV.GMU.EDU

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
"Mitchell, Michael" <[log in to unmask]>
Reply To:
MarcEdit support in technical and instructional matters <[log in to unmask]>
Date:
Thu, 12 Aug 2010 16:29:47 -0500
Content-Type:
text/plain
Parts/Attachments:
text/plain (87 lines)
Does this help? To compare two columns (i.e. ISBNs) in Excel:

Assume your data starts in Cell A1 and B1

copy this formula in Column C cell C1

=IF(COUNTIF(B:B,A1)>0,"",A1)

then copy&paste down the column C

if a number is in column A, but is missing from column B, then the number will show in column C. otherwise the cell will show blank.

Michael Mitchell
Technical Services Librarian
Brazosport College
Lake Jackson, TX
michael.mitchell at brazosport.edu 

-----Original Message-----
From: MarcEdit support in technical and instructional matters [mailto:[log in to unmask]] On Behalf Of Stacy Pober
Sent: Thursday, August 12, 2010 2:34 PM
To: [log in to unmask]
Subject: Re: [MARCEDIT-L] How to find unique records

Steve McDonald wrote:
> For general information, though, here is how I would do a project like this.  The method I have seen used is to export some fields (probably title, 001, and any other fields you think might be useful to identification, like author and ISBN) from both files into tab-delimited text files.  Import both files into a spreadsheet program like Excel.  In the table for the OCLC records, add a new column and put "OCLC" into every box in the column.  For the Springer table, add a new column and put "Springer" into every box in the column.  Combine both tables into a single table, for instance by copying and pasting.  Then you sort by the title column (primary) and Springer/OCLC column (secondary).  The resulting table should mostly alternate between matching Springer and OCLC titles.  Remove those and you are left with only the titles in one or the other.

Steve,

Thank you for the help.

Just to clarify:  Are you saying I should find them in the spreadsheet
by visually scanning for the non-alternating titles, or is there some
way to automate this?  The visual scanning method is going to be
time-consuming for such a large set of books. I wouldn't mind if it
were just a few hundred titles, but it's over 17,000, which will make
it a bit of a chore.

Stacy


On Thu, Aug 12, 2010 at 1:49 PM, McDonald, Stephen
<[log in to unmask]> wrote:
> Stacy Pober said:
>> We get a large collection of Springer e-books.  The MARC records
>> supplied by Springer are pretty bad.  They now offer records via OCLC
>> which should be better quality.
>>
>> Here's the problem:  119 records are missing from the OCLC set.
>>
>> If  I concatenate our Springer-provided MARC record files into one
>> file, can I then compare that to the OCLC file and pull out a set of
>> the books that are unique to the Springer-provided set ?
>>
>> Could someone tell me the exact steps I'd need to go through to get
>> that file of 119 records?
>
> I have not done this myself, but I'm about to do something similar.  However, when I (briefly) looked into this myself when first loading the WorldCat Collection Set records for Springer, I found that the difference was that Springer had multiple records for separate volumes which were combined in a single record on WCS.  As far as I could tell, the WCS set was in fact the complete set of books available from Springer.
>
> For general information, though, here is how I would do a project like this.  The method I have seen used is to export some fields (probably title, 001, and any other fields you think might be useful to identification, like author and ISBN) from both files into tab-delimited text files.  Import both files into a spreadsheet program like Excel.  In the table for the OCLC records, add a new column and put "OCLC" into every box in the column.  For the Springer table, add a new column and put "Springer" into every box in the column.  Combine both tables into a single table, for instance by copying and pasting.  Then you sort by the title column (primary) and Springer/OCLC column (secondary).  The resulting table should mostly alternate between matching Springer and OCLC titles.  Remove those and you are left with only the titles in one or the other.  From that you can make a file from which to do a batch search on OCLC.  After downloading the batch search results, use the extr!
 
 a !
>  information in the table (author, ISBN, etc.) to remove incorrect matches.
>
>                                        Steve McDonald
>                                        [log in to unmask]
>
> ________________________________________________________________________
>
> This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and instructional support in MarcEdit.  If you wish to communicate directly with the list owners, write to [log in to unmask] To unsubscribe, send a message "SIGNOFF MARCEDIT-L" to [log in to unmask]
>

-- 
Stacy Pober
Information Alchemist
Manhattan College Library
Riverdale, NY 10471
[log in to unmask]

________________________________________________________________________

This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and instructional support in MarcEdit.  If you wish to communicate directly with the list owners, write to [log in to unmask] To unsubscribe, send a message "SIGNOFF MARCEDIT-L" to [log in to unmask]

________________________________________________________________________

This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and instructional support in MarcEdit.  If you wish to communicate directly with the list owners, write to [log in to unmask] To unsubscribe, send a message "SIGNOFF MARCEDIT-L" to [log in to unmask]

ATOM RSS1 RSS2