There are these arrows in Unicode ⬅ ⬆ ⬇ ⬈ ⬉ ⬊ ⬋ ⬌ ⬍
But it's missing a right one. The name should be something like RIGHTWARDS BLACK ARROW, but there's no Unicode character of that name.
There are some char that seems similar, but i couldn't really find the right match. I'm looking for the right-pointing char of this set. (based on char name or semantic of the char, not font appearance)
Anyone? I need the Unicode code point.
Here's some of the char's code point
character: ⬅ (11013, #o25405, #x2b05)
character: ⬆ (11014, #o25406, #x2b06)
character: ⬍ (11021, #o25415, #x2b0d)
From the Unicode 7.0 Character Code Chart
for Miscellaneous Symbols and Arrows:
And from the Dingbats chart:
This issue was
discussed on the
Unicode Mail List, and
Jukka Korpela (author of
Unicode Explained)
mentioned:
My guess is that U+27A1 was included along with other dingbat arrows
(which are mostly rightward-pointing arrowlike symbols)
because it had been included in legacy character codes.
It was then regarded as unnecessary to duplicate it in the
Miscellaneous Symbols and Arrows block.
This is somewhat unfortunate.
Here's the timeline of when these characters were added to Unicode:
1991
The Dingbats block (2700 to 27BF) has been present since Unicode 1.0. Its characters were copied from the
ITC Zapf Dingbats font (released in 1978).
2003
The Miscellaneous Symbols and Arrows
block was created in Unicode 4.0.
2014
The RIGHTWARDS BLACK ARROW character (2B95) was
added in Unicode 7.0.
➡ http://www.fileformat.info/info/unicode/char/27a1/index.htm : ➡
None of those are a thick right arrow that the OP is looking for. If you go to the wikipedia page for arrows and scroll down to Miscellaneous Symbols and Arrows, you will see that the OP is exactly right. There are thick arrows, but no right-hand arrow.
Related
EDIT
While posting the question, character I ask for was shown well to me, but after postig it does not show up anymore. As it does not appear, please look up in original site
EDIT2
I looked for Unicode chars associated with "alien", and found no matching ones. Here is how they are compared side by side:
I found, that some texts inside my database contain character like . I am not sure, how it would rendered with different fonts and environments, so here is the image, how I see it:
I tried to identify it with different ways. For example, when I paste it into Sublime Text, it automatically shows as control character <0x85>. When I tried to identify it in different unicode-detectors (http://www.babelstone.co.uk/Unicode/whatisit.html, https://unicode-table.com/en/, https://unicode-search.net/unicode-namesearch.pl), their conclusion is pretty match the same:
Unicode code point character U+0085
UTF-8 encoding c2 85 hexadecimal
194 133 decimal
0302 0205 octal
Unicode character name <control>
Unicode 1.0 character name (deprecated) NEXT LINE (NEL)
https://unicode-search.net/unicode-namesearch.pl
also included this information
HTML encoding …
hexadecimal
…
decimal
which gave me some vague hint, how it was possible, that … become ``. But this is not main problem here.
My question is: how is possible, that control character is shown up like this and what is the actual glyph used to represent it?
I tried to sketch into http://shapecatcher.com/ to identify it but without success. I did not find such a glyph in any Unicode table.
The alien symbol is not a Unicode character; but is in Microsoft's Webdings font, with character code 0x85. Running Start > Run > charmap, then selecting Webdings from the Font drop list, opens this window:
If I click that alien character in the leftmost column, the message Character Code : 0x85 is shown at the bottom of the window.
I can even copy that character from the Character Map and paste it into Microsoft Wordpad:
The WebDings symbols were included in Unicode Release 7: Pictographic symbols (including many emoji), geometric symbols, arrows, and ornaments originating from the Wingdings and Webdings sets. Therefore you would expect the alien symbol to also be in Unicode. However, I don't think the version of Webdings that was used included that alien symbol, since Windows 10 also has a ttf file for Webdings (version 5.01), and it also does not include the alien symbol:
So presumably what originally caught your attention was some text being rendered with an older version of the Webdings font which included that alien symbol.
The glyph is 👽 U+1F47D EXTRATERRESTRIAL ALIEN. I don't know why your system misrenders a control character.
As far as I knew, there are currently two emojis for eyes. The pair of eyes (U+1F440) with hex code f09f9180 (👀), and a single eye (U+1F441) with hex code f09f9181 (👁).
I now found when using the emojis of the keyboard in my phone that another eye emoji exists, with hex code f09f9181efb88f (👁️).
The gajim messenger on the PC, and the Conversations app on the mobile phone, can display both. The gajim emoji-chooser only contains the short sequence and the Swiftkey-Keyboard Emoji-Chooser only the longer one.
When I copy and paste the emojis i.e. in the Firefox URL address bar, they look the same (blue eye, while the messengers both display them in black). When I Google for the emojis, I only find pages describing the shorter code point.
Firefox renders both emojis the same, but Vivaldi (Chromium based) shows the one with the shorter code point as narrow black and white emoji and the other one as larger brown eye.
When I Google for the hex dump, I find a lot of emojipedia sites for the shorter dump, and nothing useful at all for the longer one.
Is there somewhere any documentation about the additional emoji? Why aren't both emojis available in both emoji choosers?
f0 9f 91 80 is the UTF-8 encoded form of codepoint U+1F440.
f0 9f 91 81 is the UTF-8 encoded form of codepoint U+1F441.
f0 9f 91 81 ef b8 8f is the UTF-8 encoded form of codepoints U+1F441 U+FE0F.
U+FE0F is a Variation Selector:
Variation Selectors is a Unicode block containing 16 Variation Selector format characters (designated VS1 through VS16). They are used to specify a specific glyph variant for a Unicode character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1, VS15 and VS16 have been defined.
Where U+FE0F is VARIATION SELECTOR-16:
U+FE0F was added to Unicode in version 3.2 (2002). It belongs to the block Variation Selectors in the Basic Multilingual Plane.
This character is a Nonspacing Mark and inherits its script property from the preceding character.
The glyph is not a composition. It has a Ambiguous East Asian Width. In bidirectional context it acts as Nonspacing Mark and is not mirrored. In text U+FE0F behaves as Combining Mark regarding line breaks. It has type Extend for sentence and Extend for word breaks. The Grapheme Cluster Break is Extend.
This codepoint may change the appearance of the preceding character. If that is a symbol, dingbat or emoji, U+FE0F forces it to be rendered as a colorful image as compared to a monochrome text variant. The Unicode standard defines some standardized variants. See also “Unicode symbol as text or emoji” for a discussion of this codepoint.
In other words, U+FE0F tells VS-aware software to render U+1F441 as a colorful emoji instead of as monochromatic text.
The singular ‘👁’ is used as an emoji, but is defined as being text-style (i.e. black-and-white rather than colourful) by default. This isn’t implemented consistently across all platforms, however, so sometimes the character will also display as emoji style instead. In order to explicitly force one or the other style, the characters U+FE0E and U+FE0F can be appended to 👁 to make it appear as text style (👁︎) or emoji style (👁️) respectively. Because of the inconsistencies I mentioned, some devices and applications automatically add U+FE0F to the character (resulting in the longer code your phone keyboard produced), while others leave the character as-is (leaving just the code for the eye itself).
I'm having trouble breaking a word into its individual unicode components. I'm working with the devanagari script using google input tools. An example is र्म (pronounced -rm), which I want to break into म (-m) and the that hook at the top (-r). But I can't seem to find the unicode character that corresponds to the hook at the top. Here's some of the solutions I tried
1. copy and past र्म into MS word and hit alt x. But this breaks the word into र् and म. It doesn't give me the unicode character for the top hook
2. I tried the site http://shapecatcher.com/. I found a character called latin egyptological ain; while similar in shape, it cannot be used on top of another character. I'm looking the conjunct version of the hook.
Any help would be appreciated. I'm using TekMaker on Windows 8.
The ‘hook at the top’ representing a preceding र् is an inseparable part of the glyph for a variety of biconsonantal ligatures. It's not a discrete, freely-combinable diacritical mark as we would understand it in Latin-like scripts.
Consequently the visual rendering element doesn't have its own Unicode representation distinct from its linguistic meaning र्, sorry!
Is there any way I can write a superscript slash with Unicode?
My aim is to represent rational exponents in a nicer form than 123**(456/789).
Well unicode is full of characters. The meaning is up to its interpretation.
For superscript slash you can use:
Canadian Syllabics Final Acute 123⁴⁵⁶ᐟ⁷⁸⁹
Right Raised Omission Bracket 123⁴⁵⁶⸍⁷⁸⁹
Musical Symbol Repeated Figure-1 123⁴⁵⁶𝄍⁷⁸⁹
For subscript slash you can use:
Right Low Paraphrase Bracket 123₄₅₆⸝₇₈₉
If you have other solutions please comment and I will update my answer.
A helpful site to find special unicode characters: shapecatcher
No. On general grounds, we can be pretty sure that if such a character existed, it would be in the Superscripts and Subscripts block (not all superscripts are there, but the odds are that if any superscripts will be added, they will be placed there).
So you need some higher-level protocol, as you usually do, when you need superscripts beyond a fairly limited repertoire. Unicode is about encoding characters, not about layout and mathematical expressions.
Assuming http://en.wikipedia.org/wiki/Unicode_subscripts_and_superscripts is accurate, the answer is no.
Looking at the complete official Unicode name list and making the bold assumption it would have "slash" either in its name or description, there is no such character at this time.
I'm trying to display special unicode characters, in particular the mathematical operator 𝓮 in emacs. Specifically:
position: 283 of 317 (89%), column: 0
character: 𝓮 (displayed as 𝓮) (codepoint 120046, #o352356, #x1d4ee)
preferred charset: unicode (Unicode (ISO10646))
code point in charset: 0x1D4EE
syntax: w which means: word
category: .:Base, L:Left-to-right (strong)
buffer code: #xF0 #x9D #x93 #xAE
file code: #xF0 #x9D #x93 #xAE
(encoded by coding system utf-8-unix)
display: no font available
Unicode data:
Name: MATHEMATICAL BOLD SCRIPT SMALL E
Category: Letter, Lowercase
Combining class: Ll
Bidi category: Ll
Decomposition: font e
Character code properties: customize what to show
name: MATHEMATICAL BOLD SCRIPT SMALL E
general-category: Ll (Letter, Lowercase)
decomposition: (font 101) (font 'e')
There are text properties here:
fontified t
I'm using GNU Emacs 24 a recent nightly binary. The text above displays fine on my browser and in TextEdit however, the special characters come up empty when viewed in emacs.
I read this from an old Emacs 22 manual: "A fontset does not necessarily specify a font for all character codes. If a fontset specifies no font for a certain character, or if it specifies a font that does not exist on your system, then it cannot display that character. It will display an empty box instead." - This is the exact behavior I am observing
It seems I may need to build a fontset to be able to display such arbitrary characters (starting with the Xdefaults or Xresources files).
How can I identify which font families I will need to include in the fontset to display Math operators (most online examples refer to languages like Latin, Chinese, etc.)? I couldn't even find any examples of .Xdefault or .Xresource files.
Am I on the right track? Is there an easier/more obvious way to do this?
I have the same problem, and I don't have a general solution either. Here's my approach to fixing a single character (or potentially a range),
assuming that you have the character in a buffer and it's not displaying.
Some experimentation showed that Menlo is a useful source of characters, like FreeSerif.
Put the cursor before the non-displayed character.
m-x describe-char. This gives you a lot of information about the character, including a line of the form "code point in charset: 0x2055".
Somewhere in your .emacs or related files, use this function. It can potentially fix a whole range of characters by snagging them from the FreeSerif family or something else, but I don't have good choices for anything but a few characters.
(defun bbextra-fix-fontset-font (from &optional to family)
"Make characters FROM to TO come from FAMILY.
Default value of TO is FROM, and of FAMILY is FreeSerif (which
seems to have some of the characters)"
(set-fontset-font t (cons from (or to from))
(font-spec :family (or family "FreeSerif"))))
;; Here are the characters I have fixed.
(bbextra-fix-fontset-font #x2042)
(bbextra-fix-fontset-font #x2023)
(bbextra-fix-fontset-font #x203D)
(bbextra-fix-fontset-font #x2055)
;These come from Menlo
(bbextra-fix-fontset-font #x2620 #x2620 "Menlo") ; skull and crossbones
(bbextra-fix-fontset-font #x266C #x266C "Menlo") ; 16th notes
(bbextra-fix-fontset-font #x2695 #x2695 "Menlo") ; asclepion
(bbextra-fix-fontset-font #x2624 #x2624 "Menlo") ; caduceus
The function set-fontset-font may be used to specify which font to use for any range of characters; e.g.,
(set-fontset-font t '(#x1d4ee . #x1d4ee) (font-spec :family "FreeSerif"))
There was a known bug with MacOS emacs and displaying characters beyond the BMP. See for example my bug report at Emacs bugs.
After reporting this bug, I had an e-mail suggesting use of the “Mac port” version of emacs. This apparently displays non-BMP characters.
The bug was fixed subsequently, in 24.4 and beyond.