Anyone know where to look? What obstacles or advantages does the character sets give these languages?
A simple Python program:
import sys
filenames = sys.argv[1:]
def fixline(filename):
text = open(filename).read()
text = text.replace('\n\r','\n')
return text
for file in filenames:
print fixline(file)
and its translation: 載入 系統
文件名 = 系統.參數[1:]
定義 修正行尾(文件):
內文 = 打開(文件名).讀入()
內文 = 內文.替換('\n\r','\n')
傳回 內文
取 文件 自 文件名:
寫 修正行尾(文件)Is it the same for Cyrillic or German or Spanish for that matter?
Blown. Away.
Besides CJK, there are several programming languages that based on non-English languages [2]. A few ones are even widely used in industry.
[1] http://www.chinesepython.org/doc/sample/ask.html
[2] http://en.wikipedia.org/wiki/Non-English-based_programming_l...
If you're talking about code written in non-English languages, Peter Cooper has a fun article about writing Ruby code in UTF-8: http://www.rubyinside.com/the-split-is-not-enough-whitespace...
After his presentation, and during Q/A, someone asked Matz for help on their program. Matz walked over, crouched next to this young developer, and looked over their code. Gave some feedback, walked up to the podium and started answering more questions.
Really nice person.