Monday, April 28, 2008

Some Important Technology You Never Think About

(1) XML




You've probably heard of XML, but what is it? Where is it?
Though you may never have encountered it directly, XML is everywhere. Now in its tenth year, it has become virtually the lingua franca of data exchange.
XML stands for "extensible markup language" -- extensible because developers can add to it to suit the needs of particular applications. But what makes it really valuable is the fact that it's a language, much like HTML. Unlike some data formats, XML files aren't just streams of incomprehensible numbers. XML is designed to be read by humans as well as machines. A developer who "speaks XML" can look at a document written in an unfamiliar XML dialect and still understand what it's trying to say.
This powerful combination of features makes XML incredibly useful for all kinds of applications. But perhaps its biggest coup was Microsoft's decision to switch to XML-based file formats for Office 2007. As it turns out, you actually may have XML documents sitting on your desktop right now, without realizing it.



(2) Unicode



We use computers for every kind of communication, from IM to e-mail to writing the Great American Novel. The trouble is, computers don't speak our language. They're all digital; before they can store or process text, every letter, symbol, and punctuation mark must first be translated into numbers.
So which numbers do we use? Early PCs relied on a code called ASCII, which took care of most of the characters used in Western European languages. But that's not enough in the age of the World Wide Web. What about Cyrillic, Hindi, or Thai?
Enter Unicode, the Rosetta Stone of computing. The Unicode standard defines a unique number for every letter, symbol, or glyph in more than 30 written languages, and it's still growing. At nearly 1500 pages and counting, it's incredibly complex, but it's been gaining traction ever since Microsoft adopted it as the internal encoding for the Windows NT family of operating systems.
Most of us will never need to know which characters map to which Unicode numbers, but modern computing could scarcely do without Unicode. In fact, it's what's letting you read this article in your Web browser, right now.

No comments: