Unicode: Difference between revisions

From miki
Jump to navigation Jump to search
(→‎Unicode symbols: unicode / latex name for arrows)
No edit summary
 
(12 intermediate revisions by the same user not shown)
Line 1: Line 1:
== References ==
* https://tonsky.me/blog/unicode/ — The Absolute Minimum Every Software Developer Must Know About Unicode in 2023 (Still No Excuses!)

== Unicode symbols ==
== Unicode symbols ==
See [http://en.wikipedia.org/wiki/List_of_logic_symbols List of logic symbols on Wikipedia] for quite an extensive list

Frequently used symbols:
Frequently used symbols:


Line 6: Line 11:
!Symbol!!Unicode<br/>(hex)!!Unicode<br/>(dec)!!Keymap (linux be)!!HTML<br/>entity!!Unicode<br/>name!!LaTeX
!Symbol!!Unicode<br/>(hex)!!Unicode<br/>(dec)!!Keymap (linux be)!!HTML<br/>entity!!Unicode<br/>name!!LaTeX
|-
|-
|← ||0x2190 ||8592 || {{kbctrl|AltGr-Y}} ||&larr; <code>&amp;larr;</code> ||Leftwards Arrow ||<code>\leftarrow</code> <code>\gets</code>
|←
|0x2190
|8592
|← {{kbctrl|AltGr-Y}}
|&larr; <code>&amp;larr;</code>
|Leftwards Arrow
|<code>\leftarrow</code> <code>\gets</code>
|-
|-
|↑ ||0x2191 ||8593 || {{kbctrl|AltGr-Shift-U}} ||&uarr; <code>&amp;uarr;</code> ||Upwards Arrow ||<code>\uparrow</code>
|↑
|0x2191
|8593
|↑ {{kbctrl|AltGr-Shift-U}}
|&uarr; <code>&amp;uarr;</code>
|Upwards Arrow
|<code>\uparrow</code>
|-
|-
|→ ||0x2192 ||8594 || {{kbctrl|AltGr-I}} ||&rarr; <code>&amp;rarr;</code> ||Rightwards Arrow ||<code>\rightarrow</code> <code>\to</code>
|→
|0x2192
|8594
|→ {{kbctrl|AltGr-I}}
|&rarr; <code>&amp;rarr;</code>
|Rightwards Arrow
|<code>\rightarrow</code> <code>\to</code>
|-
|-
|↓ ||0x2193 ||8595 || {{kbctrl|AltGr-U}} ||&darr; <code>&amp;darr;</code> ||Downwards Arrow ||<code>\downarrow</code>
|↓
|0x2193
|8595
|↓ {{kbctrl|AltGr-U}}
|&darr; <code>&amp;darr;</code>
|Downwards Arrow
|<code>\downarrow</code>
|-
|-
| ⃖||0x20D6 ||8406 || || ||Combining Left Arrow Above ||
| ⃖
|0x20D6
|8406
|
|
|Combining Left Arrow Above
|
|-
|-
| ⃗||0x20D7 ||8407 || || ||Combining Right Arrow Above ||
| ⃗
|0x20D7
|8407
|
|
|Combining Right Arrow Above
|
|-
|-
| ̂||0x0302||770|| || <code>&amp;#x0302;</code> <code>&amp;#770;</code>||Combining Circumflex Accent||
|∈
|0x2208
|8712
|
|&darr; <code>&amp;darr;</code>
|Element Of
|<code>\in</code>
|-
|-
| ̄||0x0304||772|| || <code>&amp;#x0304;</code> <code>&amp;#772;</code>||Combining Macron||
|∉
|0x2209
|8713
|
|&darr; <code>&amp;darr;</code>
|Not An Element Of
|<code>\notin</code>
|-
|-
| ̅||0x0305 ||773 || || ||Combining Overline ||
|∊
|-
|0x220A
| ̲||0x0332 ||818 || || ||Combining Low Line ||
|8714
|
|-
|&darr; <code>&amp;darr;</code>
|∈ ||0x2208 ||8712 || || <code>&amp;#x2208;</code> <code>&amp;#8712;</code>||Element Of ||<code>\in</code>
|-
|Small Element Of
|<code>\smallin</code>
|∉ ||0x2209 ||8713 || || ||Not An Element Of ||<code>\notin</code>
|-
|∊ ||0x220A ||8714 || || ||Small Element Of ||<code>\smallin</code>
|-
|⊂ ||0x2282 ||8834 || || <code>&amp;#x2208;</code> <code>&amp;#8838;</code>||Subset Of ||<code>\subset</code>
|-
|⊆ ||0x2286 ||8838 || || <code>&amp;#x2208;</code> <code>&amp;#8838;</code>||Subset Of Or Equal To||<code>\subseteq</code>
|-
|· ||0x00B7 ||0183 || || ||Middle Dot ||<code>\cdotp</code> <code>\centerdot</code>
|}
|}
(LaTeX symbol name from package <tt>unicode-math</tt>)
(LaTeX symbol name from package <tt>unicode-math</tt>)
Line 87: Line 52:


Using ''Combining'' symbols:
Using ''Combining'' symbols:
* In Windows, try ''Character Map''. Use '''Search for''' with ''Combining'' to find all combining symbols.
* TBC
* Use a unicode fonts to get best results
* In Office, Word and Visio seems to give good results. Powerpoint sometimes misaligns the symbols.
** Tip: Type the letter, then type a space'' and move cursor back, then type the combining diacritic [http://www.personal.psu.edu/ejp10/blogs/gotunicode/2010/03/dealing-with-x-bar-x-and-p-hat.html]


Reference:
Reference:
Line 101: Line 69:
* In Gnome, see ''character maps'' (<code>gucharmap</code>) to get unicode hex coce.
* In Gnome, see ''character maps'' (<code>gucharmap</code>) to get unicode hex coce.
* In KDE, try ''KDE Character Selector'' (<code>KCharSelect</code>).
* In KDE, try ''KDE Character Selector'' (<code>KCharSelect</code>).

;Inserting Unicode symbols in Windows
* {{kb|Alt-KP0}}-{{kb|Alt-KP9}} &mdash; Press {{kb|Alt}} key then the unicode character code in ''decimal''.
** For small code, prefix fix <code>0</code> to force unicode, instead of ascii (eg. {{kb|Alt-0183}} to get ''Middle Dot'')
* <code>4-digit hex code</code>, then {{kb|Alt-X}} &mdash; Type unicode character code in ''hexadecimal'', then press {{kb|Alt-X}} (at least in Office applications)

== Unicode fonts ==

Some fonts with good unicode support.


On Windows:
* Arial MS Unicode
* Cambria Math

All platforms:
* [http://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&id=FontDownloads&_sc=1 sil.org] (see Gentium)

Latest revision as of 12:43, 3 October 2023

References

Unicode symbols

See List of logic symbols on Wikipedia for quite an extensive list

Frequently used symbols:

Symbol Unicode
(hex)
Unicode
(dec)
Keymap (linux be) HTML
entity
Unicode
name
LaTeX
0x2190 8592 ^AltGr-Y &larr; Leftwards Arrow \leftarrow \gets
0x2191 8593 ^AltGr-Shift-U &uarr; Upwards Arrow \uparrow
0x2192 8594 ^AltGr-I &rarr; Rightwards Arrow \rightarrow \to
0x2193 8595 ^AltGr-U &darr; Downwards Arrow \downarrow
0x20D6 8406 Combining Left Arrow Above
0x20D7 8407 Combining Right Arrow Above
̂ 0x0302 770 &#x0302; &#770; Combining Circumflex Accent
̄ 0x0304 772 &#x0304; &#772; Combining Macron
̅ 0x0305 773 Combining Overline
̲ 0x0332 818 Combining Low Line
0x2208 8712 &#x2208; &#8712; Element Of \in
0x2209 8713 Not An Element Of \notin
0x220A 8714 Small Element Of \smallin
0x2282 8834 &#x2208; &#8838; Subset Of \subset
0x2286 8838 &#x2208; &#8838; Subset Of Or Equal To \subseteq
· 0x00B7 0183 Middle Dot \cdotp \centerdot

(LaTeX symbol name from package unicode-math)

How to find Unicode symbols:

  • shapecatcher.com - unicode character recognitiondraw the symbol with the mouse to find it!
  • In Gnome, see character maps (gucharmap) to get unicode hex code.
  • In KDE, try KDE Character Selector (KCharSelect).
  • In Windows, try Character Map. Use Search for field to search characters with matching description (click Advanced view)

Using Combining symbols:

  • In Windows, try Character Map. Use Search for with Combining to find all combining symbols.
  • Use a unicode fonts to get best results
  • In Office, Word and Visio seems to give good results. Powerpoint sometimes misaligns the symbols.
    • Tip: Type the letter, then type a space and move cursor back, then type the combining diacritic [1]

Reference:

Tips

Inserting Unicode symbols in Linux

These are similar to Windows' Alt-key code.

  • Ctrl-Shift-u, followed by unicode hexadecimal code while maintaining Ctrl-Shift (see [2])
Ex Ctrl-Shift-ue7 for ç, Ctrl-Shift-ue8 for è, Ctrl-Shift-ue9 for é...
  • In Gnome, see character maps (gucharmap) to get unicode hex coce.
  • In KDE, try KDE Character Selector (KCharSelect).
Inserting Unicode symbols in Windows
  • Alt-KP0-Alt-KP9 — Press Alt key then the unicode character code in decimal.
    • For small code, prefix fix 0 to force unicode, instead of ascii (eg. Alt-0183 to get Middle Dot)
  • 4-digit hex code, then Alt-X — Type unicode character code in hexadecimal, then press Alt-X (at least in Office applications)

Unicode fonts

Some fonts with good unicode support.


On Windows:

  • Arial MS Unicode
  • Cambria Math

All platforms: