The A.C.O.R.N. Lexicon.
The A.C.O.R.N. lexicon is an Open Source lexicon designed for computational linguistics research. Other computational linguistics resources are listed here.
Original and Official Project Gutenberg Web Site
Project Gutenberg provides freely-redistributable electronic texts. When possible, we make use of Project Gutenberg resources and donate our work back to Project Gutenberg.
The Online Books Initiative (OBI) is a large collection of text and related materials. (A mirror is also available).
UPenn's Online Books page features over 10000 online books free to the public.
Another collection of electronic texts, including a mirror of the archive formerly hosted at quartz.rutgers.edu (one of the largest pre-WWW (1990-1994) electronic text archives).
The Moby Lexicon Project, including words lists, hyphenation and pronounciation data, and a thesaurus, is complete and has been placed in the public domain.
Word lists based on Moby and other large databases for various special interests. Also links to many other projects and word lists.
I'm tired of people sending the webmaster email about this, so I wrote up a page of urls.
The ONline Information eXchange Guidelines consist of data elements that can be used to describe the contents of an electronic work. This description is for meta data (e.g., title, author, etc.), not the actual body of the work.