2
6

J
u
n
e

2
0
0
4

Microsoft Word File Formats

If you’ve ever looked at the document files created by Microsoft Word, you’ll find that the text is there amongst all sorts of hieroglyphics. Furthermore, it also includes some of the text that you’ve previously deleted, which has been a known security issue. Up until today I didn’t know why this was the case, although I suspected it was due to remnants from the ability to have multiple undo functionality.

It turns out that Microsoft Word derives its file format from Bravo and BravoX. Two word processors developed by some guys at Xerox who left en masse to join Microsoft in around 1982-3. The first version of Microsoft Word was essential a port of BravoX to MS-DOS. BravoX stored its files by doing a straight memory dump. The legacy of this file format lives on today in the current versions of Microsoft Word.

To read more about this, and more, visit Bruce Damer’s Personal Histories of the Desktop User Interface.

Leave a Reply

copyright ©2006 and so on, ninthspace.org, except quotations, lyrics and some images which are the rights of their respective holders