Unicode for latex symbol \varoiint, I got the unicode 222F but it's not correct
unicode 222F
My requirement is
Related
One of my references in Bibdesk contains some latin/Greek character e.g. 'β'. I am getting the error while using the reference in TEXMAKER:
"! Package inputenc Error: Unicode char \u8:β not set up for use with LaTeX."
How can I set it up to work?
Though with inputenc TeX can read all the unicode characters, it doesn't know what to do with most of them, except those in the usual ascii range. I once also had a problem with that, when I wanted to copy some unicode text verbatim into one of my TeX documents, and that text contained symbols like alpha, or other math symbols.
The solution to that is the command \DeclareUnicodeCharacter{#1}{#2} where in #1 you have to put the unicode value of the character and in #2 a tex expression, that gets inserted, when character code #1 is encountered. E.g. for the beta you could use \DeclareUnicodeCharacter{03B2}{\ensuremath{\beta}}, because 03B2 is the unicode character value for the symbol "beta" (you have to look those things up in a Unicode table).
I've also written a tex package for that, if you're interested. It can be found on github at https://github.com/ezander/utf8math. See especially this file here: https://github.com/ezander/utf8math/blob/master/utf8math.sty
Another solution is to use XeTeX, which is more suitable than most other TeX engines for unicode : replace the lines
\usepackage[T1]{fontenc}
\usepackage[utf8]{inputenc}
with
\usepackage{fontspec}
As the title says:
Which encoding do I have to choose in emacs long list of encodings (accessible via e.g., change for saving file: C-x RET f), that it matches ansinew in LaTeX as given by:
\usepackage[ansinew]{inputenc}
In inputenc manual you can read
ansinew Windows 3.1 ANSI encoding, extension of Latin-1 (synonym for
cp1252).
In Emacs cp1252 is an alias of windows-1252, so you can use both of them.
I'm encoding Unicode Emoji characters into a string and loading that into a webview..
for the encoding I'm using "NSUTF8StringEncoding" But the emoji characters just show up as jiberish unicode symbols like ∆¬ÓÔÓ˝ˆ So what encoding do I need?
NSUTF32StringEncoding did the trick, didn't see it on the list of encoding before.
When I read the documentation about set-fontset-font, I found there is a sentence like below:
(set-fontset-font NAME TARGET FONT-SPEC &optional FRAME ADD)
Modify fontset NAME to use FONT-SPEC for TARGET characters.
...
TARGET may be a script name symbol. In that case, use FONT-SPEC for all characters that belong to the script.
TARGET may be a charset. In that case, use FONT-SPEC for all characters in the charset.
...
But I cannot find any more information about what the "script name symbol" is?
I search via google to find some example that uses 'kan 'ascii 'kana. But I don't find any detail information about what it exactly is? Is there a way to list all the possible value?
BTW,
Is there a way to list all the possible value of "charset" that is available to set-fontset-font?
A "script name symbol" is a means to associate a lisp symbol with a particular charset, a set of charsets, or a range of characters. So, for example, if you look at the value of the variable "charset-script-alist", you will see an Alist of charsets vs the corresponding most appropriate script name symbols. If you look at the value of the variable "script-representative-chars", you will see an Alist of script name symbols vs the representative characters. The variable "char-script-table" shows the full mapping of characters to script name symbols (as specified by "script-representative-chars"). I don't know of any "definitive" list of script names (for example, "ascii" is a script name as well but isn't contained in these 2 variables); however, the values that are contained in the variables "charset-script-alist" and "char-script-table" are probably most of them.
To get a list of all the possible values of "charset" that is available to set-fontset-font, use the "list-character-sets" function. You can also use the "list-charset-chars" function to see the list of characters contained in a specific charset.
M-:(char-table-extra-slot char-script-table 0) gives
'(latin phonetic greek coptic cyrillic armenian hebrew arabic
syriac nko thaana devanagari bengali gurmukhi gujarati oriya
tamil telugu kannada malayalam sinhala thai lao tibetan burmese
georgian hangul ethiopic cherokee canadian-aboriginal ogham runic
khmer mongolian symbol braille han ideographic-description
cjk-misc kana bopomofo kanbun yi cham tai-viet linear-b
aegean-number ancient-greek-number ancient-symbol phaistos-disc
lycian carian olt-italic ugaritic old-persian deseret shavian
osmanya cypriot-syllabary phoenician lydian kharoshthi cuneiform
cuneiform-numbers-and-punctuation byzantine-musical-symbol
musical-symbol ancient-greek-musical-notation
tai-xuan-jing-symbol counting-rod-numeral mathematical
mahjong-tile domino-tile)
For character sets, try M-xlist-character-sets.
So there is "�", the replacement character 0xFFFD, the symbol for a byte-sequence that is not represented as a character in Unicode (right?).
Well, I wonder what 'is' this 'thing' actually, as I can't 'see'/'find' it in Windows' character table, neither searching for the symbol itself, nor searching for FFFD. But after all it is a character, right? So it should be in there. I am confused ...
The font Arial does probably not contain a corresponding glyph to represent that character. Try a different font like Arial Unicode or Arial Unicode MS.