For our use case, I believe that something in the style of the UNIX iconv function would almost always do what we need. We generally just need to turn things like "æ" into "ae" or "þ" into "th."
But I also understand why it would be important to target specific languages and why a half-baked solution would be undesirable. Either way, thanks for considering my idea! If anyone has any alternate workflows that could achieve something similar to this, I'm open to suggestions.
John Jackson
Cataloging Librarian
Materials Management
Chattahoochee Valley Libraries
706-243-2823
> -----Original Message-----
> From: MarcEdit support in technical and instructional matters <MARCEDIT-
> [log in to unmask]> On Behalf Of Terry Reese
> Sent: Monday, September 18, 2023 4:54 PM
> To: [log in to unmask]
> Subject: Re: [MARCEDIT-L] Transliteration while building a new field
>
> The problem with this kind of transliteration is I'd need to note the specific
> language. MarcEdit includes transliterations for targeting specific fields -- but
> you have to pick the language as each set of transliteration rules is different
> when going between utf8 and ascii. I don't have a good way to answer this
> question at the moment.
>
> --tr
>
> -----Original Message-----
> From: MarcEdit support in technical and instructional matters <MARCEDIT-
> [log in to unmask]> On Behalf Of John Jackson
> Sent: Monday, September 18, 2023 3:05 PM
> To: [log in to unmask]
> Subject: [MARCEDIT-L] Transliteration while building a new field
>
> Hello,
>
> My library includes the first three letters of an author's name in each book's
> call number, which we usually transliterate into ASCII characters. (It's much
> easier for staff to sort books alphabetically with only 26 letters.) Is it possible
> to do such transliteration while building a new field?
>
> I'm imagining something similar to an "iconv" function. This hypothetical
> pattern:
> =092 \\$a{100$a.iconv("utf-8", "ascii//TRANSLIT").substring(0, 3)} With this
> input:
> =100 0\$aÆvar Þór Benediktsson,$d1984-$eauthor.
> Would produce this field:
> =092 \\$aAev
>
> Of course, I made up the "iconv" part of that example. Would adding such a
> feature be feasible? Or is there a workaround to achieve the same result?
> (Or am I just missing something obvious?) I'm happy to receive any advice!
>
> John Jackson
> Cataloging Librarian
> Materials Management
> Chattahoochee Valley Libraries
> 706-243-2823
>
>
> __________________________________________________________
> ______________
>
> This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and
> instructional support in MarcEdit. If you wish to communicate directly with
> the list owners, write to [log in to unmask] To
> unsubscribe, send a message "SIGNOFF MARCEDIT-L" to
> [log in to unmask]
>
> __________________________________________________________
> ______________
>
> This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and
> instructional support in MarcEdit. If you wish to communicate directly with
> the list owners, write to [log in to unmask] To
> unsubscribe, send a message "SIGNOFF MARCEDIT-L" to
> [log in to unmask]
________________________________________________________________________
This message comes to you via MARCEDIT-L, a Listserv(R) list for technical and instructional support in MarcEdit. If you wish to communicate directly with the list owners, write to [log in to unmask] To unsubscribe, send a message "SIGNOFF MARCEDIT-L" to [log in to unmask]
|