{"id":339,"date":"2018-03-14T17:48:35","date_gmt":"2018-03-14T16:48:35","guid":{"rendered":"https:\/\/www.hsu-hh.de\/hisalt\/?page_id=339"},"modified":"2018-03-21T12:44:14","modified_gmt":"2018-03-21T11:44:14","slug":"betautf8","status":"publish","type":"page","link":"https:\/\/www.hsu-hh.de\/hisalt\/betautf8","title":{"rendered":"BETAUTF8"},"content":{"rendered":"<p>BETAUTF8<br \/>\n<strong>NAME<\/strong><br \/>\nbetautf8 &#8211; a fast, flexible beta code to unicode (utf8) file converter<\/p>\n<p>&nbsp;<\/p>\n<blockquote><p><em><strong>All Programms and Data mentioned belowed can be find in the Download Area at the bottom of the page.<\/strong><\/em><\/p><\/blockquote>\n<p>&nbsp;<\/p>\n<p><strong>SYNOPSIS<\/strong><br \/>\nbeta8utf8 input_file output_file [ -n ] [ -g or -h or -l ] [-eol=nn,nn]<\/p>\n<p>&nbsp;<\/p>\n<p><strong>DESCRIPTION<\/strong><br \/>\nBetautf8 is a fast, flexible translation program which converts beta code text files as used by PHI or TLG into unicode standard text files. Betautf8 translates all beta-coded Greek into unicode (utf8); the more common beta code symbols (brackets, diacritical marks, special symbols <abbr title=\"et cetera\">etc.<\/abbr>) are converted correctly. All unknown symbols and combinations are put out in unconverted form, except for formatting options (@nnn &amp;nnn $nnn): With these, the numeric options are simply deleted.<\/p>\n<p>Without any parameters given, the program works best with output created by Burkhard Mei\u00dfner\u00b4s\u00a0<a href=\"https:\/\/www.hsu-hh.de\/hisalt\/spitbol-programming-by-for-classicists-accessing-and-analyzing-classical-texts\">View and Find system for beta-coded text files<\/a>\u00a0<a href=\"http:\/\/hsu-hh.de\/hisalt\/viewandfind\" rel='nofollow'>(now in the PUBLIC DOMAIN)<\/a>, assuming that all lines in the files start with a 32-byte reference section which is put out in unconverted form. If your input to BETAUTF8 was not produced by V&amp;F, use the \/N or -n parameter.<\/p>\n<p>The program puts out all lines which start with a \u223c (column 0, beginning-of-line) as they are in the original file. If this is unsatisfactory,\u00a0please let me know.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>COMMAND LINE OPTIONS<\/strong><br \/>\n-n \u00a0 Specify the &#8222;no reference&#8220; option: The program normally assumes there are 32 bytes of reference information present at the beginning of each text line. With this option set, this assumption is not made. Instead, the program takes any line to begin directly with beta code symbols.<\/p>\n<p>&nbsp;<\/p>\n<p>-h -l -g \u00a0 Specify the startup language: H(ebrew) or G(reek) or L(atin).<\/p>\n<p>-eol=nn,nn \u00a0 Define end-of-line character(s).<\/p>\n<p>For ASCII files use -eol=13,10 &#8211; for Unix style files -eol=10 &#8211; for Mac file -eol=13<\/p>\n<p>&nbsp;<\/p>\n<p><strong>EXAMPLES<\/strong><br \/>\nThe following example, P.Lips. I 90 = Stud.Pal. III 118 (Hermupolis, AD 614\/615), shows how betautf8 converts beta code texts. If you have a browser or pdf viewer which is capable of utf8 (unicode), you can compare beta code input for betautf8 to the programs&#8217;s unicode output. Choose between\u00a0<a href=\"http:\/\/hsu-hh.de\/hisalt\/example-1-html\" rel='nofollow'><abbr title=\"Hypertext Markup Language\">HTML<\/abbr><\/a> Version\u00a0and<\/p>\n<div class=\"col-xs-12 downloads-item\">\n<div class=\"row border-line\">\n<div class=\"thumbnail-area\"><img decoding=\"async\" class=\"download-image\" src=\"\/wp-content\/themes\/hsu\/img\/dummy\/downloads_dummy.png\" alt=\"\" \/><\/div>\n<div class=\"text-area\"><span class=\"download-text\">Example 1 Pdf<\/span><\/div>\n<div class=\"download-area\"><a class=\"download-link\" href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/b8_ex.pdf\"><span class=\"download-name\">pdf laden<\/span><span class=\"donwload-icon\"><img decoding=\"async\" class=\"download-icon\" src=\"\/wp-content\/themes\/hsu\/img\/icons\/download_icon.png\" alt=\"download icon\" \/><\/span><span class=\"download-size\">38 KB<\/span><\/a><\/div>\n<\/div>\n<\/div>\n<p>This example, an anonymous Philipp history, may also be looked at. If your browser or pdf viewer is capable of utf8 (unicode), you can compare beta code input to unicode output. Choose between<a href=\"http:\/\/hsu-hh.de\/hisalt\/howtohtml\" rel='nofollow'>\u00a0<abbr title=\"Hypertext Markup Language\">HTML<\/abbr> <\/a>Version\u00a0and<\/p>\n<div class=\"col-xs-12 downloads-item\">\n<div class=\"row border-line\">\n<div class=\"thumbnail-area\"><img decoding=\"async\" class=\"download-image\" src=\"\/wp-content\/themes\/hsu\/img\/dummy\/downloads_dummy.png\" alt=\"\" \/><\/div>\n<div class=\"text-area\"><span class=\"download-text\">PDF File Original beta code file<\/span><\/div>\n<div class=\"download-area\"><a class=\"download-link\" href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/b8_ex2.pdf\"><span class=\"download-name\">pdf laden<\/span><span class=\"donwload-icon\"><img decoding=\"async\" class=\"download-icon\" src=\"\/wp-content\/themes\/hsu\/img\/icons\/download_icon.png\" alt=\"download icon\" \/><\/span><span class=\"download-size\">61 KB<\/span><\/a><\/div>\n<\/div>\n<\/div>\n<p>The third example, a medley of astrological, mathematical, papyrological and epigraphical texts, shows how powerful the betacode-to-unicode conversion algorithm of betautf8 is, and how much of the beta code text files is rendered correctly in unicode (utf8) by betautf8. <a href=\"http:\/\/hsu-hh.de\/hisalt\/comparebetacodeinputfile\" rel='nofollow'>Just compare the\u00a0beta code input file\u00a0<\/a>to the betautf8 output:<a href=\"http:\/\/hsu-hh.de\/hisalt\/betacodeoutput\" rel='nofollow'>\u00a0the unicode (utf8) file<\/a>. It may be easier for you to look at the output file in more common file formats:<\/p>\n<div class=\"col-xs-12 downloads-item\">\n<div class=\"row border-line\">\n<div class=\"thumbnail-area\"><img decoding=\"async\" class=\"download-image\" src=\"\/wp-content\/themes\/hsu\/img\/dummy\/downloads_dummy.png\" alt=\"\" \/><\/div>\n<div class=\"text-area\">Example 3\u00a0 .doc<\/div>\n<div class=\"download-area\"><a class=\"download-link\" href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/ex3.doc\"><span class=\"download-name\">msword laden<\/span><span class=\"donwload-icon\"><img decoding=\"async\" class=\"download-icon\" src=\"\/wp-content\/themes\/hsu\/img\/icons\/download_icon.png\" alt=\"download icon\" \/><\/span><span class=\"download-size\">48 KB<\/span><\/a><\/div>\n<\/div>\n<\/div>\n<p><strong>FILES AND DIRECTORIES<\/strong><br \/>\nThese are subject to differences depending on local installation conventions; all what you need is betautf8.exe (32-bit DOS version or\u00a032-bit Win-32 version) in a suitable place and DOS or Windows (or appropriate emulator) to execute it. To execute the system under the Linux operating system directly, install\u00a0Phil Budne&#8217;s <a href=\"http:\/\/www.snobol4.org\/\" rel='nofollow'>CSnobol4<\/a> implementation of the SNOBOL4 language\u00a0or\u00a0Dave Shield&#8217;s Linux:<a href=\"https:\/\/github.com\/hardbol\/spitbol\" rel='nofollow'>SPIBOL compiler<\/a>. If used with\u00a0Mark Emmer&#8217;s <a href=\"http:\/\/www.snobol4.com\/\" rel='nofollow'>SPITBOL386 compiler<\/a>, the program automatically exits, writing the standalone module (betautf8.exe) which can then be used independently.<\/p>\n<p><strong>INTERNET RESOURCES<\/strong><br \/>\nMain websites:<br \/>\n<a href=\"https:\/\/papyri.uni-leipzig.de\/content\/start.xml\" rel='nofollow'>Das Papyrus-Projekt Halle-Jena-Leipzig<\/a><\/p>\n<p>Snobol4\/Spitbol:<br \/>\n<a href=\"http:\/\/www.snobol4.org\/\" rel='nofollow'>Phil Budne&#8217;s SNOBOL4 resources<\/a><\/p>\n<p><a href=\"http:\/\/www.snobol4.org\/csnobol4\/\" rel='nofollow'>Phil Budne&#8217;s free CSnobol4 implementation<\/a><\/p>\n<p><a href=\"https:\/\/github.com\/hardbol\/spitbol\" rel='nofollow'>Dave Shield&#8217;s Linux:SPIBOL compiler<\/a><\/p>\n<p><a href=\"http:\/\/www.snobol4.com\/\" rel='nofollow'>Mark Emmer&#8217;s SPITBOL resources (including product and price lists for various versions of SNOBOL4 and SPITBOL)<\/a><\/p>\n<p><a href=\"http:\/\/hsu-hh.de\/hisalt\/source-code-for-snobol4-and-spitbol386\" rel='nofollow'> Source Code (for SNOBOL4 and SPITBOL386)<\/a><\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"http:\/\/hsu-hh.de\/hisalt\/gnu-general-public-license\" rel='nofollow'>General Public License<\/a><\/p>\n<p>Binary (DOS or emulation)<\/p>\n<p>Binary (Win-32)<\/p>\n<p>Manual page (for Unix groff compatible formatters)<\/p>\n<p>Mark Emmer&#8217;s free 16-bit SNOBOL4+ interpreter<\/p>\n<p><strong>Download archives with ready-to-run files and Data above: Download is in the Download Area at the bottom of the page.<\/strong><\/p>\n<p>Linux: Contains 64bit and 32bit CSnobol4+betautf8.sno source file and the new Linux-SPITBOL-betautf8.spx version (10 times as fast!):\u00a0linux.zip<\/p>\n<p>MS-DOS, emulated DOS under Linux: 32-bit DOS extended betautf8.exe program file:\u00a0dos32exe.zip<\/p>\n<p>MS-DOS, emulated DOS under Linux: 32-bit CSnobol4+betautf8.sno source file:\u00a0dos_csno.zip<\/p>\n<p>Win32, emulated Windows under Linux: 32-bit betautf8.exe program:\u00a0win32exe.zip<\/p>\n<p>Win32, emulated Win32 under Linux: 32-bit CSnobol4+betautf8.sno source file:\u00a0win32.zip<\/p>\n<p>MS-DOS, emulated DOS under Linux: 16-bit snorun.exe+betautf8.sav save module:\u00a0dos_sav.zip<\/p>\n<p>Mac OS X: 32-bit CSnobol4+betautf8.sno:\u00a0mac_osx.bz2<\/p>\n<p>OpenSolaris: 32-bit CSnobol4+betautf8.sno:\u00a0solaris.zip<\/p>\n<p>FreeBSD: 32-bit CSnobol4+betautf8.sno:\u00a0freebsd.zip<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Additional utilities:<\/strong><br \/>\ncompose\/decompose: filters to normalize and convert Unicode (UTF-8) files. These filters read a unicode text file and convert it to either a form with all (if possible) accents intimately combined with their respective letters (precombined accents), or else completely detached from them (accents uncombined). Also included are programs to convert between UTF-8 and UTF-16 encodings.<\/p>\n<p>CONVCONC: Reformatting program useful to convert concordance files (index.v&amp;f) to UTF-8 using betautf8.<\/p>\n<p><a href=\"http:\/\/hsu-hh.de\/hisalt\/TABTOSP\" rel='nofollow'>tabtosp: replaces tabs by spaces in input files<\/a><\/p>\n<p>To selectively remove references,\u00a0refx.exe\u00a0from the V&amp;F distribution is a much more precise and flexible solution.<\/p>\n<p>DUKE: Prepares output of internet version of Duke Database of Documentary Papyri for conversion by betautf8<\/p>\n<p>Source code\u00a0<a href=\"http:\/\/hsu-hh.de\/hisalt\/scd\" rel='nofollow'>duke.sno<\/a><\/p>\n<p>MS-DOS .exe\u00a0duke.exe<\/p>\n<p>16-bit .sav\u00a0duke.sav<\/p>\n<p>32-bit Windows .exe\u00a0duke.exe<\/p>\n<p><strong>SPEED<\/strong><br \/>\nTo get an idea of how fast the different implementations are, the entire Polybius text (45381 lines, 2 621 444 bytes) was translated into unicode (45381 lines, 5 779 219 bytes), using a LapTop computer [Machine: Fujitsu Siemens ESPRIMO Mobile V5535 V1.06 with Intel(R) Celeron 540 @ 1.86GHz and 4 GBytes of RAM under SuSE Linux 11.0 with kernel 2.6.25.20-2 (x86_64)].<\/p>\n<p>Effectively, there are therefore five speed-groups: Linux-SPITBOL or betautf8.exe (32bit Windows), betautf8.exe (MS-DOS), betautf8.exe (FreeDOS), the CSnobol4 implementations, and 16-bit DOS Snobol4+. Their relative speeds are roughly like:<\/p>\n<p>100% : 65% : 50% : 15% : 0.15%<\/p>\n<p>Therefore, from the point of view of speed, betautf8.exe (either the DOS or the 32bit Windows version) and Linux-SPITBOL+betautf8.spx is the way to go. Experiments with different sets of texts from the TLG &#8222;E&#8220; (FUJITSU SIEMENS ESPRIMO Mobile V5535 laptop, SuSE 11.0 Linux) have demonstrated the translation speed of these systems to be between 5827 (dosemu+betautf8.exe) and 11615 (wine+betautf8.exe) beta code text lines per second.<\/p>\n<p><strong>LICENSING<\/strong><br \/>\nBetautf8 is distributed under the<a href=\"http:\/\/hsu-hh.de\/hisalt\/gnu-general-public-license\" rel='nofollow'>\u00a0GNU General Public License without any warranty<\/a>, its source being provided together with the running program itself. However, the author kindly asks anybody who makes additions, corrections or other suitable contributions to the program and\/or its source, to provide the author with a copy of these contributions in order to provide the community with ever improved versions of the software. To contact the author, either use his e-mail address:\u00a0bmeissne@hsu-hamburg.de<\/p>\n<p>or his university post box:<\/p>\n<p><abbr title=\"Professorin \/ Professor\">Prof.<\/abbr> <abbr title=\"Doktorin \/ Doktor\">Dr.<\/abbr> Burkhard Meissner<br \/>\nProfessur f\u00fcr Alte Geschichte<br \/>\nHelmut-Schmidt-Universit\u00e4t<br \/>\nUniversity of the Federal Armed Forces<\/p>\n<p>Holstenhofweg 85<br \/>\nD-22043 Hamburg (Germany)<br \/>\nAUTHOR<br \/>\n<a href=\"http:\/\/hsu-hh.de\/hisalt\/lehrstuhlinhaber\" rel='nofollow'>Betautf8 has been written by\u00a0Burkhard Meissner, University of the Federal Armed Forces, Hamburg, Germany.<\/a><\/p>\n<p>&nbsp;<\/p>\n<p><strong>Download Area<\/strong><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/betautf8.1.zip\">betautf8.1<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/betautf8.zip\">betautf8<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/compose.zip\">compose<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/convconc.zip\">convconc<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/dos_csno.zip\">dos_csno<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/dos_sav.zip\">dos_sav<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/dos32exe.zip\">dos32exe<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/duke-sav-2.zip\">duke sav (2)<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/duke.zip\">duke<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/ex3-ps-2.zip\">ex3 ps (2)<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/ex3.zip\">ex3<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/freebsd.zip\">freebsd<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/linux.zip\">linuxBet<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/mac_osx.zip\">mac_osx<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/refx.zip\">refx<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/solaris.zip\">solaris<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/win32.zip\">win32BetVer<\/a><\/p>\n<p><a href=\"https:\/\/www.hsu-hh.de\/hisalt\/wp-content\/uploads\/sites\/743\/2018\/03\/win32exe.zip\">win32BetVerexe<\/a><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>BETAUTF8 NAME betautf8 &#8211; a fast, flexible beta code to unicode (utf8) file converter &nbsp; All Programms and Data mentioned belowed can be find in the Download Area at the [&hellip;]<\/p>\n","protected":false},"author":185,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"categories":[],"tags":[],"class_list":["post-339","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/www.hsu-hh.de\/hisalt\/wp-json\/wp\/v2\/pages\/339","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.hsu-hh.de\/hisalt\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.hsu-hh.de\/hisalt\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.hsu-hh.de\/hisalt\/wp-json\/wp\/v2\/users\/185"}],"replies":[{"embeddable":true,"href":"https:\/\/www.hsu-hh.de\/hisalt\/wp-json\/wp\/v2\/comments?post=339"}],"version-history":[{"count":16,"href":"https:\/\/www.hsu-hh.de\/hisalt\/wp-json\/wp\/v2\/pages\/339\/revisions"}],"predecessor-version":[{"id":519,"href":"https:\/\/www.hsu-hh.de\/hisalt\/wp-json\/wp\/v2\/pages\/339\/revisions\/519"}],"wp:attachment":[{"href":"https:\/\/www.hsu-hh.de\/hisalt\/wp-json\/wp\/v2\/media?parent=339"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.hsu-hh.de\/hisalt\/wp-json\/wp\/v2\/categories?post=339"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.hsu-hh.de\/hisalt\/wp-json\/wp\/v2\/tags?post=339"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}