Ncursesw

From RogueBasin
Revision as of 18:06, 6 September 2010 by Bear (talk | contribs) (Largely a cut-n-paste from an article I posted on RGRD. Hope this helps people.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Ncursesw

The version of the popular ncurses library that handles wide characters, or Unicode, is surprisingly difficult to get working with C programs. This article is intended to be a checklist for developers so that they can effectively use the library.

As with most development articles, This will be a bit too specific in terms of platform. This article was written with respect to a Linux development platform running Debian Linux. To the extent that your platform is different, there are likely to be important things I don't know about getting development with this library working.

First, you have to be using a UTF-8 locale  (Mine is en_US.UTF-8; I imagine others will have different choices).  Type 'locale' at a shell prompt to be sure.

Second, you have to have a term program that can display non-ASCII characters.  Most of them can handle that these days, but there are still a few holdouts. Terminal and Konsole, popular term programs on Linux, are both good.

Third, you have to use a console font which contains glyphs for the non-ASCII characters that you use.  Again, most default console fonts can handle that these days, but it's still another gotcha, and if you routinely pick some random blambot font to use on the console you're likely to miss out.

Try typing a non-ASCII character at the console prompt just to make sure you see it. If you don't know how to type non-ASCII characters from the keyboard, that's beyond the scope of what's covered here and you'll need to go and read some documentation and possibly set some keyboard preferences. Anyway, if you see it, then you've got the first, second, and third things covered.

Fourth, you have to have ncurses configured to deal with wide characters.   For most linux distributions, that means:     A: your ncurses distribution is based on version 5.4 or later        (mine is 5.7) but *NOT* on version 11.  I have no idea where        version 11 came from, but it's definitely a fork based on        a pre-5.4 ncurses version, and hasn't got the Unicode        extensions.     B: you have the 'ncursesw' versions, which are configured for        wide characters.  How this works depends on your distribution,        but for Debian, you have to get both the 'ncursesw' package        to run ncurses programs that use wide characters and the        'ncursesw-dev' package to compile them.

Fifth, your program has to call "setlocale" immediately after it starts up, before it starts curses or does any output.  If it doesn't call setlocale, your program will remain in the 'C' locale, which assumes that the terminal cannot display any characters outside the ASCII set.  If you do any output or start curses before calling setlocale, you will force your runtime to commit to some settings before it knows the locale, and then setlocale won't have all of the desired effects. Your program is likely to print ASCII transliterations for characters outside the ASCII range if this happens.

Sixth, you have to #define _X_OPEN_SOURCE_EXTENDED in your source before any library #include statements.  The wide character curses functions are part of a standard called the XOPEN standard, and preprocessing conditionals check this symbol to see whether your program expects to use that standard. If this symbol is found, then macroexpansion will configure the headers you include to actually have definitions for the documented wide-character functions.  It's not just the 'ncursesw' header that depends on it; you will get bugs and linking problems with other libraries if you have this symbol defined for some includes but not others, so it's very important to put it before ALL include statements.

Unfortunately, it's not mentioned in the man pages of any of the functions that won't link if you don't do it. You have to hunt through a bunch of not-very-related man pages before you find the only page that mentions it.  

Seventh, you have to refer to the ncursesw header file relative to some spot on your library path.  This can be subtle, because the header files in question have the same names as the standard ncurses header files, so they won't go in the same place.  And of course the man pages are helpful enough to tell you that name but not helpful enough to tell you *WHICH* file by that name you need to use. My library path is basically  /usr/include/ .  The headers for wide ncurses all went into a directory in that folder named ncursesw.  So instead of saying #include <ncurses.h> as the man pages say, I had to say #include <ncursesw/ncurses.h>.  The former refers to the "regular" ncurses.h which doesn't have the wide-character functions and the latter refers to the wide-character versions.  

Eighth, you have to use the -lncursesw compiler option (as opposed to the -lncurses option) when you're linking your executable.  

Once you cover these eight points, you should be able to develop a roguelike game in C with Unicode characters using the ncursesw library.