Questions tagged [character-set]

Refers to text input character sets such as Unicode or ASCII.

What Questions Should Have This Tag?:

Questions include:

  • Questions about the character sets in Ubuntu, using them, installing them, etc.

Basic Definitions:

Character Encoding - In computing, a character encoding is used to represent a repertoire of characters by some kind of an encoding system.


Brief Introduction To The Subject:

In computer science, the terms "character encoding", "character map", "character set" and "code page" were historically synonymous, as the same standard would specify a repertoire of characters and how they were to be encoded into a stream of code units – usually with a single character per code unit. The terms now have related but distinct meanings, reflecting the efforts of standards bodies to use precise terminology when writing about and unifying many different encoding systems. Regardless, the terms are still used interchangeably, with character set being nearly ubiquitous.


Important Links For Learning More:

38 questions
16
votes
3 answers

How to write en and em dashes?

I understand that to be able to type en and em dashes I have to configure the COMPOSE key, or something like that, but I am not sure about that nor on how I get from there to be able to type en and em dashes in addition to the minus sign. How do I…
user364819
15
votes
3 answers

Opening a non-utf8 encoded text file

I sometimes need to open text files that are encoded in EUC-KR. man gedit gives: --encoding Set the character encoding to be used for opening the files listed on the command line. This can be used to open specific…
user85023
6
votes
2 answers

Why Chinese for my uploads?

Didn't use to happen. Now happens with any upload of csv data, regardless of content (and it isn't Chinese!). I checked cat /etc/default/locale LANG="en_US.UTF-8" My language settings are all English
Michael Durrant
  • 10,666
  • 21
  • 60
  • 81
5
votes
3 answers

How to print the ■ character in linux terminal using C?

This char is 254 in ASCII Extended Table, and 25A0 in Unicode. If I run putchar(254) the terminal does not recognize the char, as I think it utilizes not extended ASCII.
5
votes
1 answer

Ubuntu 20.04 how to remove duplicate packages after upgrade

I noticed that in My Ubuntu 20.04, I have some duplicate packages (from Ubuntu 18.04): Calculator, System monitor, etc. For instance, when I search for System monitor, here is the result: In addition, the old packages appear with…
user545149
4
votes
3 answers

How to print an octal value's corresponding UTF-8 character in bash?

I expected printf %s '\' to do the trick, but it doesn't: printf %s '\101' Outputs: \101
kos
  • 35,535
  • 13
  • 101
  • 151
4
votes
1 answer

Cedilla no longer working with US Intl. keyboard layout

I'm using Ubuntu 20.04.4 LTS with the US Intl. keyboard layout, but when I type ', then c, I'm now getting a ć instead of the regular ç. How can I fix this?
3
votes
1 answer

What is the "Character Map" application used for?

Ubuntu comes pre-installed with an application called "Character Map". In the terminal, it can be launched as "charmap". What is this application typically used for? What function does it provide people that Ubuntu sees as so necessary as to include…
Anon
  • 11,863
  • 22
  • 66
  • 122
2
votes
2 answers

Unable to change gnome-terminal default character encoding

I have tried everything that I could find on the internet. $> gconftool-2 --type string --set /apps/gnome-terminal/profiles/Default/encoding "en_US.UTF-8" $> cat /etc/environment ... LC_ALL="en_US.UTF-8" $>…
shikhanshu
  • 131
  • 1
  • 4
2
votes
1 answer

how to display chinese character in txt file on ubuntu 16.04

I tried some methods to fix this issue,but no lucky.I can't find the auto-detected item to configure it.So what I should do next,I need help,and thanks a lot!
Patrick
  • 21
  • 2
2
votes
0 answers

Cannot enter Latin-1 characters in tmux

After upgrading to Ubuntu 16.04, I cannot enter Latin-1 characters (like Umlaute ä etc.) in tmux. I tried the packaged tmux, self-installed tmux 2.1 and 2.2. Furthermore, if I 'view' a file in tmux, they just disappear. My settings…
Sven
  • 179
  • 6
2
votes
0 answers

How to avoid strange characters with Internet downloaded file names

I use Ubuntu Studio 15.10, in my desktop PC, and Lubuntu 15.10, in my laptop. With both OS, I get strange characters into the downloaded file names from Internet. I mean, file names with spanish (latin) characters, originally. For example: if the…
Juan
  • 1,737
  • 6
  • 27
  • 61
2
votes
1 answer

Why do Filenames created under UTF-8 not match the UTF-Codepage layout?

According to Wikipedia's utf-8 Codepage Layout, its character-set is as follows: ! " # $ % & ' ( ) * + , - . / 0-9 : ; < = > ? @ A-Z [ \ ] ^ _ ` a-z { | } ~ But (excepting /), filenames beginning with these character are sorted by name…
Kraus
  • 41
  • 1
  • 1
  • 3
2
votes
1 answer

Custom Keyboard Variant (for Transliteration of Arabic) not working

I tried setting up a custom keyboard layout in Ubuntu 18.04.1 LTS for being able to write Arabic transliteration for scientific purposes (see: https://en.wikipedia.org/wiki/Romanization_of_Arabic ; https://en.wikipedia.org/wiki/DIN_31635 ) Therefore…
user873389
2
votes
1 answer

expect interferes with character encoding during ssh

In my company we use expect to automate tasks, when ssh'ing into other systems. All our legacy systems run ISO-8859-1 character encoding, while our desktop systems run UTF-8 encoding. Which of course poses certain challenges when ssh'ing from one of…
Jens Bang
  • 23
  • 6
1
2 3