toad.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Mastodon server operated by David Troy, a tech pioneer and investigative journalist addressing threats to democracy. Thoughtful participation and discussion welcome.

Administered by:

Server stats:

206
active users

#unicode

3 posts3 participants0 posts today
Shufei 🧮<p><a href="https://meta.wikimedia.org/w/index.php?search=Vertical+writing&amp;title=Special%3ASearch&amp;ns0=1&amp;ns12=1&amp;ns200=1&amp;ns202=1&amp;searchToken=vtm1d09u2iyi9e668pfvpe5u" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">meta.wikimedia.org/w/index.php</span><span class="invisible">?search=Vertical+writing&amp;title=Special%3ASearch&amp;ns0=1&amp;ns12=1&amp;ns200=1&amp;ns202=1&amp;searchToken=vtm1d09u2iyi9e668pfvpe5u</span></a></p><p>It’s 2025 and:<br>- There is still no vertical text site mode for <a href="https://merveilles.town/tags/Wikimedia" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Wikimedia</span></a> in any language using vertical text.<br>- <a href="https://merveilles.town/tags/Wikipedia" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Wikipedia</span></a> still forces “simplified” Chinese on browsers.<br>- There is still no true IDS or CangJie composition matrix for characters in <a href="https://merveilles.town/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a>. <br>- SignWriting still has no proper <a href="https://merveilles.town/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a> inclusion, no IDS analogue, no inventory of signs, and is still mostly written by mouse drag and drop in a mishmash of SVG and HTML.<br>- There is no proper SignWriting IME, such as a Rime schema.</p><p>To say this state of affairs is cultural propaganda by mass technic inertia would be an understatement. Infotech is functional colonialism. Thats really all there is to say.</p><p>Filed under <a href="https://merveilles.town/tags/%E5%B4%87%E6%B4%8B%E5%AA%9A%E5%A4%96" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>崇洋媚外</span></a></p>
Martin Maciaszek :commodore:<p>Updated my unilookup utility. It now accepts unicode strings on stdin as well as a command line parameter. Can be installed directly from PyPi. <a href="https://github.com/fastjack/unilookup" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/fastjack/unilookup</span><span class="invisible"></span></a><br><a href="https://maciaszek.social/tags/unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>unicode</span></a> <a href="https://maciaszek.social/tags/python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>python</span></a></p>
screwlisp<p>Real talk.</p><p>What are some other plant/insect/bird <a href="https://gamerplus.org/tags/unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>unicode</span></a> characters? I thought of</p><p>🪴🌳🎄🌲🌴⸙🍃🥬🥕🥦🍁🍀🍂🍉🍏🍎🍍🌵🌿🥗🥒🥔🍈🍌🎋🍄🪵🥝🫐🫒🍠🍓🥥🍋🍇</p><p>🐛🪱🦋🪲🐝🪰🐞🐜🦟🕷🦀🦞🪳🐌🐚🦂🕸</p><p>🐥🐦🦅🐔🐓🦆🦢🐤🐣🦃🕊🐧🦉</p>
Dr. bar. met. Paul B.<p><span class="h-card" translate="no"><a href="https://mastodon.gamedev.place/@lritter" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>lritter</span></a></span> <br>🯁🯂🯃 That's why I ❤️ <a href="https://karlsruhe-social.de/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a> 😄</p>
Adële 🌹<p><strong>Unicode characters for Creative Commons symbols</strong></p><p>I've just discovered that there are symbols since Unicode 13.0 for CC licences</p><ul><li><p>CC: 🅭</p></li><li><p>BY: 🅯</p></li><li><p>NC: 🄏</p></li><li><p>ND: ⊜</p></li><li><p>SA: 🄎</p></li><li><p>PD: 🅮</p></li><li><p>CC0: 🄍</p></li></ul><p><a href="https://en.wikipedia.org/wiki/Creative_Commons_license#Unicode_symbols" rel="nofollow noopener" target="_blank">source</a></p><p><a href="https://social.pollux.casa/tags/creativecommons" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>creativecommons</span></a> <a href="https://social.pollux.casa/tags/emoji" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>emoji</span></a> <a href="https://social.pollux.casa/tags/unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>unicode</span></a></p>
Veronica Olsen 🏳️‍🌈🇳🇴🌻<p>Got a bug report for <span class="h-card" translate="no"><a href="https://fosstodon.org/@novelwriter" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>novelwriter</span></a></span> from someone who uses Cuneiform text in their work. These are 4 byte Unicode symbols, and turned out to be very tricky to handle. 😅</p><p>The app is built with Python, which will switch a string to UCS-4 when it contains such characters, so the characters always have a single index in the string.</p><p>However, the Qt library uses UTF-16. That means 4-byte characters use two slots, creating a mismatch in indices between the two representations.</p><p><a href="https://mastodon.online/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a> <a href="https://mastodon.online/tags/Qt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Qt</span></a> <a href="https://mastodon.online/tags/Code" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Code</span></a> <a href="https://mastodon.online/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a></p>
Flominator<p>Fascinating: Two feeds for <span class="h-card" translate="no"><a href="https://freiburg.social/@hinterzarten_news" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>hinterzarten_news</span></a></span> couldn't be properly pasted from the website anymore, because they changed the dates from having &amp;#8202; ("Hair Space") between the dots and the numbers, into &amp;#8203; ("Zero Width Space"). Shout-out to the creator of <a href="https://www.mauvecloud.net/charsets/CharCodeFinder.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">mauvecloud.net/charsets/CharCo</span><span class="invisible">deFinder.html</span></a>, which is a really helpful tool for finding out, what <a href="https://genealysis.social/tags/character" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>character</span></a> you exactly have in front of you.</p><p><a href="https://genealysis.social/tags/webdev" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>webdev</span></a> <a href="https://genealysis.social/tags/unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>unicode</span></a> <a href="https://genealysis.social/tags/html" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>html</span></a></p>
Head·word /ˈhedˌwɜː(ɹ)d/ n.<p><span class="h-card" translate="no"><a href="https://typo.social/@UnicodeWatch" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>UnicodeWatch</span></a></span> </p><p>Interesting to see letters like :Dania_LongI:, :Phonotypic_ith:, and :Phonotypic_oi: proposed for inclusion in Unicode! :Unicode: </p><p><a href="https://lingo.lol/tags/EnglishPhonotypicAlphabet" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EnglishPhonotypicAlphabet</span></a> <a href="https://lingo.lol/tags/PhonotypicAlphabet" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PhonotypicAlphabet</span></a> <a href="https://lingo.lol/tags/Phonotypic" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Phonotypic</span></a> <a href="https://lingo.lol/tags/Dania" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Dania</span></a> <a href="https://lingo.lol/tags/Phonetic" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Phonetic</span></a> <a href="https://lingo.lol/tags/Phonetics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Phonetics</span></a> <a href="https://lingo.lol/tags/PhoneticTranscription" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PhoneticTranscription</span></a> <a href="https://lingo.lol/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a></p>
Greg<p>Did you know that new <a href="https://icosahedron.website/tags/Emoji" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Emoji</span></a> can be proposed by anyone, simply by following some guidelines laid out by the <a href="https://icosahedron.website/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a> consortium? There's a time window each year where they accept proposals, and a select few might make it into future sets.</p><p>This year I turned one in: "Circuit Board", which I was surprised to find 1. didn't exist and 2. had not been proposed before (though CPU and Microchip have both been submitted and declined in the last 5 years)</p><p>You can read my proposal here:<br><a href="https://storage.googleapis.com/greg-kennedy.com/Proposal%20for%20Emoji%20%E2%80%9CCircuit%20Board%E2%80%9D.pdf" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">storage.googleapis.com/greg-ke</span><span class="invisible">nnedy.com/Proposal%20for%20Emoji%20%E2%80%9CCircuit%20Board%E2%80%9D.pdf</span></a></p><p>and you can see the Unicode emoji proposal guidelines here:<br><a href="https://www.unicode.org/emoji/proposals.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">unicode.org/emoji/proposals.ht</span><span class="invisible">ml</span></a></p><p>Anyway, the odds aren't great of getting accepted, but if it IS then you can say "hey! I know the guy who submitted that one!"</p><p>Attached are the sample images I drew up for the proposal - which, incidentally, are now Public Domain as well. Enjoy!</p>
Doktor Overcomma :vepi:<p>Very cool, copy-paste UTF text from, e.g., Wikipedia, get Unicode.<br>Sanskrit अश्विन्<br>can be in your HTML as<br>&amp;<a href="https://dobbs.town/tags/x0905" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>x0905</span></a>;&amp;<a href="https://dobbs.town/tags/x0936" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>x0936</span></a>;&amp;<a href="https://dobbs.town/tags/x093f" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>x093f</span></a>;&amp;<a href="https://dobbs.town/tags/x0935" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>x0935</span></a>;&amp;<a href="https://dobbs.town/tags/x0928" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>x0928</span></a>;&amp;<a href="https://dobbs.town/tags/x094d" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>x094d</span></a>;<br><a href="https://r12a.github.io/app-conversion/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">r12a.github.io/app-conversion/</span><span class="invisible"></span></a><br><a href="https://dobbs.town/tags/UTF" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>UTF</span></a> <a href="https://dobbs.town/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a> <a href="https://dobbs.town/tags/conversion" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>conversion</span></a></p>
SnoopJ<p>TIL that in <a href="https://hachyderm.io/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a>, U+23BE through U+23CC are a series of symbols dedicated to the notation (?!) of dentistry</p>
Eana Hufwe<p><a href="https://s.1a23.studio/tags/TIL" rel="nofollow noopener" target="_blank">#TIL</a><span> </span><a href="https://s.1a23.studio/tags/Unicode" rel="nofollow noopener" target="_blank">#Unicode</a><span> 有计划在第三平面收录小篆、甲骨文、金文等古代文种。 </span><a href="https://www.unicode.org/roadmaps/tip/" rel="nofollow noopener" target="_blank">https://www.unicode.org/roadmaps/tip/</a></p>
RevK :verified_r:<p>Ah, <a href="https://toot.me.uk/tags/unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>unicode</span></a>...</p>
Aaron “#e14n pro” Madlon-Kay<p>Android 16 (SDK 36) is now out and it has added support for the following emoji:</p><p>🪉🪏🪾🫆🫜🫟🫩</p><p>(This is the only change in code point coverage versus SDK 35 that I could detect.)</p><p>I've updated Is It Tofu? with the new data:<br><a href="https://tofu.quest/?q=%F0%9F%AA%89%F0%9F%AA%8F%F0%9F%AA%BE%F0%9F%AB%86%F0%9F%AB%9C%F0%9F%AB%9F%F0%9F%AB%A9" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">tofu.quest/?q=%F0%9F%AA%89%F0%</span><span class="invisible">9F%AA%8F%F0%9F%AA%BE%F0%9F%AB%86%F0%9F%AB%9C%F0%9F%AB%9F%F0%9F%AB%A9</span></a></p><p><a href="https://mastodon.social/tags/Android" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Android</span></a> <a href="https://mastodon.social/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a> <a href="https://mastodon.social/tags/IsItTofu" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IsItTofu</span></a></p>
`Da Elf<p><span class="h-card" translate="no"><a href="https://mastodon.social/@doctorwhom" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>doctorwhom</span></a></span> Ha! At The Mag, remember Ops got this filtering software and we decided to see if we could break it? Tox added an "@" before the &lt;html&gt; tag. </p><p>Browser totally rendered the page but filter wouldn't parse it because of the At 🤣🤣🤣</p><p>I think it took us three hours?</p><p><a href="https://mstdn.social/tags/porn" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>porn</span></a> <a href="https://mstdn.social/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a></p>
Sven<p>Downloading as many HTML pages that you can find that has <a href="https://mastodon.social/tags/porn" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>porn</span></a>. Converting all those texts to <a href="https://mastodon.social/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a>. Doing text comparisons to see if there is something similar between those pages. This does not help you find/filter porn.</p><p>Somebody paid me to try this, it didn't work.</p>
Sven<p>In there a <a href="https://mastodon.social/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a> way to represent 100 in a single fullwidth character?</p>
Inautilo<p><a href="https://mastodon.social/tags/Development" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Development</span></a> <a href="https://mastodon.social/tags/Fun" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Fun</span></a><br>Decorative text within HTML · It’s perfectly valid HTML, but you may not like it <a href="https://ilo.im/164c2o" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">ilo.im/164c2o</span><span class="invisible"></span></a></p><p>_____<br><a href="https://mastodon.social/tags/HtmlAttributes" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HtmlAttributes</span></a> <a href="https://mastodon.social/tags/Comments" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Comments</span></a> <a href="https://mastodon.social/tags/Emoji" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Emoji</span></a> <a href="https://mastodon.social/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a> <a href="https://mastodon.social/tags/AsciiArt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AsciiArt</span></a> <a href="https://mastodon.social/tags/WebDev" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebDev</span></a> <a href="https://mastodon.social/tags/Frontend" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Frontend</span></a> <a href="https://mastodon.social/tags/HTML" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HTML</span></a></p>
SnoopJ<p><span class="h-card" translate="no"><a href="https://mastodon.sdf.org/@argv_minus_one" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>argv_minus_one</span></a></span> there are a surprising number of moving pieces here, but as with all things <a href="https://hachyderm.io/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a> a lot of it boils down to "the UCD stores relevant properties and software interprets those"</p><p>I don't know the entire story, but my touchpoint for this are CJK numerals like 一, 二, 三</p><p>Those cannot be converted as decimals or digits, but they *do* have a numeric value</p><p>In Python:</p><p>```<br>&gt;&gt;&gt; import unicodedata<br>&gt;&gt;&gt; unicodedata.decimal("三")<br>...<br>ValueError: not a decimal<br>&gt;&gt;&gt; unicodedata.digit("三")<br>...<br>ValueError: not a digit<br>&gt;&gt;&gt; unicodedata.numeric("三")<br>3.0<br>```</p><p>For CJK specifically this is probably related to the fact that these are often combined with other glyphs multiplicatively, and there are plenty of non-decimal glyphs.</p>
argv minus one<p>Fun fact: <a href="https://mastodon.sdf.org/tags/Unicode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Unicode</span></a> recognizes 38 different ways to write the digits zero through nine.</p><p>You can translate them into ASCII digits, and thus parse a decimal number written in any Unicode-recognized script, by subtracting the digit's code point from the code point for zero, then adding the code point for ASCII zero.</p><p>This won't work for number systems that aren't decimal place-value, though, like Roman or Babylonian cuneiform. I don't even want to know how to parse those.</p><p><a href="https://mastodon.sdf.org/tags/programming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>programming</span></a> <a href="https://mastodon.sdf.org/tags/i18n" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>i18n</span></a></p>