website/content/research/progcomp/strings.md

16 lines
798 B
Markdown
Raw Normal View History

2020-01-15 21:51:49 -05:00
# Strings
## Character Codes
Character codes are mappings between numbers and symbols which make up a particular alphabet.
The *American Standard Code for Information Interchange* (ASCII) is a single-byte character code where $2^7 = 128$ characters are specified.
Symbol assignments were not done at random. Several interesting properties of the design make programming tasks easier:
- All non-printable characters have either the first three bits as zero or all seven lowest bits as one. This makes it easy to eliminate them before displaying junk.
- Both the upper and lower case letters and the numerical digits appear sequentially
- We can get the numeric order of a letter by subtracting the first letter
- We can convert a character from uppercase to lowercase by $Letter - "A" + "a"$