So many Z39.50 servers can actually take a format request. When you check
retrieve data in Unicode in the Z39.50 client, one of the options that is
exposed is a request for you to tell ME what the default server format is --
because ME will make that request, but if it doesn't respond in that format
(which is most cases), then it knows how to convert the data. My guess is
that the server you are working with likely either just returns data in UTF8
or like LC, has two different databases, one that return UTF8 data and one
that returns MARC8 data and format will be dependent on the server you
request.
In terms of embedding character conversion beyond the UTF8 conversion into
the Z39.50 client -- I'd have to see. In theory -- it might be relatively
straight forward -- if it is, I could maybe add it here. If not, then it
would require a conversion step.
Your second question -- yes, ME has an API to convert data from one
character set to another in both the .NET and COM library. There is also an
option in the commandline tool. So, there would be scripting options.
--tr
-----Original Message-----
From: MarcEdit support in technical and instructional matters
[mailto:[log in to unmask]] On Behalf Of Bryan Baldus
Sent: Wednesday, December 16, 2015 1:38 PM
To: [log in to unmask]
Subject: [MARCEDIT-L] Automated character conversion tools
After retrieving a set of records via Z39.50, I noticed that the characters
were encoded in something other than MARC8 (looking at the raw MARC in a
text editor shows Windows-style composed characters, such as "Kairós"
(instead of {acute}o) and "©" (instead of {copy})). I figure they are likely
in UTF8/Unicode, despite LDR/09 being coded blank rather than a. Using MARC
Tools->Tools->Character Conversion Tools, I converted from UTF8 to MARC8 and
the diacritics seemed to change to their normal (for one used to using
MARC8) appearance.
I'm guessing there is probably nothing I can do to ask the Z39.50 server to
send MARC8 records, so we'll need to deal with converting the format once
the records have been downloaded. Does MarcEdit offer this functionality
through an API (or some other way that doesn't require direct human
interaction with the files)? If so, how would this be implemented?
Thank you for your assistance,
Bryan Baldus
Senior Cataloger
Quality Books Inc.
The Best of America's Independent Presses
1-800-323-4241x402
[log in to unmask]
________________________________________________________________________
This message comes to you via MARCEDIT-L, a Listserv(R) list for technical
and instructional support in MarcEdit. If you wish to communicate directly
with the list owners, write to [log in to unmask] To
unsubscribe, send a message "SIGNOFF MARCEDIT-L" to
[log in to unmask]
________________________________________________________________________
This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and instructional support in MarcEdit. If you wish to communicate directly with the list owners, write to [log in to unmask] To unsubscribe, send a message "SIGNOFF MARCEDIT-L" to [log in to unmask]
|