MARCEDIT-L Archives

June 2017

MARCEDIT-L@LISTSERV.GMU.EDU

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Terry Reese <[log in to unmask]>
Reply To:
MarcEdit support in technical and instructional matters <[log in to unmask]>
Date:
Fri, 30 Jun 2017 14:33:24 -0400
Content-Type:
text/plain
Parts/Attachments:
text/plain (80 lines)
The &#x12D; is a correct representation of UTF8 characters not represented
in the MARC8 character set.  This is how LC provides backwards compatibility
between UTF8 and MARC8.  Your ILS should support the syntax -- this has been
documented for almost 10 years now.  You can sometimes substitute these
values for other MARC8 codes, but MarcEdit doesn't do that for you.  It does
a direct character to character translation.

--tr

-----Original Message-----
From: MarcEdit support in technical and instructional matters
[mailto:[log in to unmask]] On Behalf Of Donnelly, Elaine, R
Sent: Friday, June 30, 2017 2:06 PM
To: [log in to unmask]
Subject: [MARCEDIT-L] Alexander Street Press diacritics problem

Is anyone out there using MarcEdit to edit MARC records supplied by
Alexander Street Press? I'm downloading records from their website, not from
OCLC.

Alexander Street Press supplies the records in UTF-8 format but our ILS
database is in MARC-8 so I have to convert them. I can't tell if the source
file contains bad data or the UTF8-to-MARC8 conversion is failing. I get
stuff like this, where some of the diacritics convert correctly and some
don't:

=100  1\$aShostakovich, Dmitri&#x12D; Dmitrievich,$d1906-1975,$ecomposer.
=245  00$aString Quartets /$cJan{acute}a&#x10D;ek, Szymanowski.
=511  0\$aCzech Chamber Philharmonic ; Vojt&#x11B;ch Spurn{acute}y,
conductor.

These appear all over the record, in headings, in titles, in contents notes.
I get this whether I convert the file before I make other edits or after,
creating a separate converted file. Sometimes the string corresponds to
spaces, dashes, quotation marks or other marks of punctuation, as if it was
copied directly from HTML.  But the source file in UTF-8 never has these
weird strings.

I've had other people tell me that they convert ASP files to MARC-8 and
don't have this problem. Has anyone on this list encountered the problem?
Did you find a workaround?

I'm currently working on a file of 1183 bibs and there are 779 fields with
these little landmines in them. So, not a small problem.

Thanks, Elaine

Elaine Donnelly
Cataloging Manager, Truxal Library
Anne Arundel Community College
[log in to unmask]
410-777-2849 voice
410-777-4216 fax




  ________________________________

The information contained in this email may be confidential and/or legally
privileged. It has been sent for the sole use of the intended recipient(s).
If the reader of this message is not an intended recipient, you are hereby
notified that any unauthorized review, use, disclosure, dissemination,
distribution, or copying of this communication, or any of its content, is
strictly prohibited. If you have received this communication in error,
please contact the sender by reply email and destroy all copies of the
original message. Thank you.

________________________________________________________________________

This message comes to you via MARCEDIT-L, a Listserv(R) list for technical
and instructional support in MarcEdit.  If you wish to communicate directly
with the list owners, write to [log in to unmask] To
unsubscribe, send a message "SIGNOFF MARCEDIT-L" to
[log in to unmask]

________________________________________________________________________

This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and instructional support in MarcEdit.  If you wish to communicate directly with the list owners, write to [log in to unmask] To unsubscribe, send a message "SIGNOFF MARCEDIT-L" to [log in to unmask]

ATOM RSS1 RSS2