A document (noun) is a bounded physical or digital representation of a body of information Information, in its most restricted technical sense, is an ordered sequence of symbols. As a concept, however, information has many meanings. Moreover, the concept of information is closely related to notions of constraint, communication, control, form, instruction, knowledge, meaning, mental stimulus, pattern, perception, and representation designed with the capacity (and usually intent) to communicate Communication is a process of transferring information from one entity to another. Communication processes are sign-mediated interactions between at least two agents which share a repertoire of signs and semiotic rules. Communication is commonly defined as "the imparting or interchange of thoughts, opinions, or information by speech, writing,. A document may manifest symbolic A symbol is something such as an object, picture, written word, sound, or particular mark that represents something else by association, resemblance, or convention. For example, a red octagon may be a symbol for "STOP". On maps, crossed sabres may indicate a battlefield. Numerals are symbols for numbers . All language consists of symbols, diagrammatic or sensory-representational information. To document (verb) is to produce a document artifact by collecting and representing information. In prototypical usage, a document is understood as a paper artifact, containing information in the form of ink marks. Increasingly documents are also understood as digital artifacts.
Colloquial usage is revealed by the connotations and denotations that appear in a Web search for document. From these usages, one can infer the following typical connotations:
- Writing that provides information person's thinking by means of symbolic marks.
- A written account of ownership or obligation.
- To record in detail; "The parents documented every step of their child's development".
- A digital file in a particular format.
- To support or supply with references; "Can you document your claims?".
- An artifact that meets a legal notion of document for purposes of discovery in litigation.
- Document is the practical construct for describing matter Matter is a general term for the substance of which all physical objects are made. Typically, this includes atoms and other particles which have mass. However in practice there is no single correct scientific meaning; each field uses the term in different and often incompatible ways. A common way of defining matter is as anything that has mass and in different forms which retain information Information, in its most restricted technical sense, is an ordered sequence of symbols. As a concept, however, information has many meanings. Moreover, the concept of information is closely related to notions of constraint, communication, control, form, instruction, knowledge, meaning, mental stimulus, pattern, perception, and representation for a reasonable period of time wherein it can be perceived In philosophy, psychology, and cognitive science, perception is the process of attaining awareness or understanding of sensory information. The word "perception" comes from the Latin words perceptio, percipio, and means "receiving, collecting, action of taking possession, apprehension with the mind or senses." by a sentient Sentience is the ability to feel or perceive. The term is used in science and philosophy, and in the study of artificial intelligence. Sentience is used in the study of consciousness to describe the ability to have sensations or experiences, known to Western philosophers as "qualia". In eastern philosophy, sentience is a metaphysical observing entity An entity is something that has a distinct, separate existence, though it need not be a material existence. In particular, abstractions and legal fictions are usually regarded as entities. In general, there is also no presumption that an entity is animate. Entities are used in system developmental models that display communications and internal.
The variety usage reveals that the notion of document has rich social and cultural aspects besides the physical, functional and operational aspects.
- Document is just a practical concept which presently would be defined narrowly based on human understanding and perception of the external world.
- Document in its wider connotation could include matter in all its forms, even a universe could be perceived as a document on a wider scale.
- The practical construct requires the retention of information but the relevance of the information (utility, value are not decided as these depend upon the objectives of the user and the purpose for which he accesses the information)
- The information must also be with reference to the observing entity be retained for a reasonable period of time wherein it can be observed. Fleeting images which cannot be seen are almost as if never observed.
Conceptualization in analytical philosophy
The notion of document admits both an empirical (in terms of a fuzzy set of real-world instances) and analytical characterization. The analytical characterization hinges on the semantic character of the word document, as well as the use of a primitive notion of document in accounts of larger communication constructs such as discourses, or related constructs such as language games.
The nominal 'document', like other nominals, exhibits familiar patterns of polysemy (a kind of ambiguity). For example, "document" might be used on an occasion to denote a certain body of information independently of how that information is physically rendered (as in 'the Bible is my favorite document.'; 'Have you finished reading all the documents for Monday's class yet?'), or it might be used to denote a particular physical instantiation of a body of information (as in 'that document is worn and needs to be re-bound.'; 'Return the documents you borrowed to the reference desk.'). This kind of polysemy bears some similarity to what Nunberg, 1979 termed "container/contents polysemy" (as in 'Mary broke the bottle' versus 'the baby finished the bottle'). These patterns of polysemy exhibited by 'document' matter for the following reason. A certain document qua body of information (e.g. the Bible, not a particular bound copy thereof) will have different properties than a document qua physical rendering of a body of information (e.g. a particular bound copy of the Bible). Importantly, the latter would have the property of being a static, physically bounded thing. The former would have the properties of being able to evolve over time, being susceptible of certain changes to information content, and being capable of supporting multiple physical instantiations that have allowable differences in information content. This distinction is relevant to the discussion of aspects and history of documents below.
Empirical characterization
In light of the polysemy Polysemy (from the Greek: πολυ-, poly-, "many" and σήμα, sêma, "sign") is the capacity for a sign (e.g., a word, phrase, etc.) or signs to have multiple meanings (sememes), i.e., a large semantic field. This is a pivotal concept within social sciences, such as media studies and linguistics of the core concept of document, it is useful to note a number of examples ranging from instances commonly understood as prototypical A prototype is an original type, form, or instance of something serving as a typical example, basis, or standard for other things of the same category. The word derives from the Greek πρωτότυπον , "primitive form", neutral of πρωτότυπος (prototypos), "original, primitive", from πρῶτος (protos), " documents, to instances that are understood as documents only in specialized or rare situations.
- Prototypical Documents: Letters, memos, legal forms, owners manual An owners manual is an instructional book or booklet that is supplied with almost all technologically advanced consumer products such as vehicles, home appliances and computer peripherals. Information contained in the owners manual typically includes:
- Documents of Record: Newspapers A newspaper is a regularly scheduled publication containing news, information, and advertising. By 2007 there were 6580 daily newspapers in the world selling 395 million copies a day (55 million in the U.S). The worldwide recession of 2008, combined with the rapid growth of web-based alternatives, caused a serious decline in advertising and, magazines Magazines, periodicals, glossies or serials are publications, generally published on a regular schedule, containing a variety of articles, generally financed by advertising, by a purchase price, by pre-paid magazine subscriptions, or all three. Magazines can be distributed through the mail; through sales by newsstands, bookstores or other vendors;
- Books: Textbooks A textbook or coursebook is a manual of instruction in any branch of study. Textbooks are produced according to the demands of educational institutions. Although most textbooks are only published in printed format, many are now available as online electronic books and increasingly in scanned format in P2P networks, novels A novel is a long narrative in literary prose. The genre has historical roots both in the fields of the medieval and early modern romance and in the tradition of the novella. The latter supplied the present generic term in the late 18th century, cookbooks A cookbook is a book that contains information on cooking. It typically contains a collection of recipes, and may also include information on ingredient origin, freshness, selection and quality, encyclopedias An encyclopedia is a type of reference work, a compendium holding information from either all branches of knowledge or a particular branch of knowledge, comic books A comic book is a magazine made up of narrative artwork in the form of separate "panels" that represent individual scenes, often accompanied by dialog (usually in word balloons, emblematic of the comic book art form) as well as including brief descriptive prose. The first comic book appeared in the United States of America in 1934,
- Canonical Documents: Code of law A Code is a type of legislation that purports to exhaustively cover a complete system of laws or a particular area of law as it existed at the time the code was enacted, by a process of codification. Though the process and motivations for codification are similar in common law and civil law systems, their usage is different. In a civil law country,, statute A statute is a formal written enactment of a legislative authority that governs a state, city, or county. Typically, statutes command or prohibit something, or declare policy. The word is often used to distinguish law made by legislative bodies from case law and the regulations issued by government agencies. Statutes are sometimes referred to as, constitution A constitution is a set of laws that a set of people have made and agreed upon for government—often codified as a written document—that enumerates and limits the powers and functions of a political entity. These rules together make up, i.e. constitute, what the entity is. In the case of countries and autonomous regions of federal countries the, religious text Religious texts, also known as scripture, are the texts which various religious traditions consider to be sacred, or of central importance to their religious tradition. Many religions and spiritual movements believe that their sacred texts are divinely or supernaturally inspired
- Transactional Documents: Cheques A cheque or check is a piece of paper (usually) that orders a payment of money. The person writing the cheque, the drawer or maker, usually has a chequing account where their money is deposited. The maker writes the various details including the money amount, date, and a payee on the cheque, and signs it, ordering their bank, know as the drawee,, contracts In law, a contract is an agreement between two or more parties which, if it contains the elements of a valid legal agreement, is enforceable by law or by binding arbitration. That is to say, a contract is an exchange of promises with specific legal remedies for breach. These can include Compensatory remedy, whereby the defaulting party is required, medical prescriptions A prescription is a health-care program implemented by a physician or other medical practitioner in the form of instructions that govern the plan of care for an individual patient. Prescriptions may include orders to be performed by a patient, caretaker, nurse, pharmacist or other therapist. Commonly, the term prescription is used to mean an order, receipt A receipt is a written acknowledgement that a specified article or sum of money has been received as an exchange for goods or services. The receipt acts as the title to the property obtained in the exchange[citation needed], forms A form is a document with spaces in which to write or select, for a series of documents with similar contents. The documents usually have the printed parts in common, possibly except for a serial number. Advantages of forms include:, Postage stamps A postage stamp is adhesive paper evidence of a fee paid for postal services. Usually a small rectangle attached to an envelope, the stamp signifies the person sending it has fully or partly paid for delivery. Postage stamps are the most popular way of paying for retail mail; alternatives include postal stationery such as prepaid-postage envelopes,
- Functional Documents: Portable Document Format Portable Document Format is a generic computer term.[citation needed] The best-known PDF implementation is Adobe PDF, a file format created by Adobe Systems in 1993 for document exchange. The remainder of this article discusses Adobe PDF exclusively (PDF) files, PostScript PostScript is a dynamically typed concatenative programming language created by John Warnock and Charles Geschke in 1982. PostScript is best known for its use as a page description language in the electronic and desktop publishing areas files, XML XML is a set of rules for encoding documents in machine-readable form. It is defined in the XML 1.0 Specification produced by the W3C, and several other related specifications, all gratis open standards files, email Electronic mail, commonly called email or e-mail, is a method of exchanging digital messages across the Internet or other computer networks. Email systems are based on a store-and-forward model in which email server computer systems accept, forward, deliver and store messages on behalf of users, who only need to connect to the email infrastructure,
- Non-Prototypical Documents: Post-it notes A Post-it note is a piece of stationery with a re-adherable strip of adhesive on the back, designed for temporarily attaching notes to documents and to other surfaces: walls, desks, computer displays, and so forth. While now available in a wide range of colors, shapes, and sizes, Post-it Brand notes are most commonly a 3-inch square, canary yellow, fortune cookie A fortune cookie is a crisp Asian cookie usually made from flour, sugar, vanilla, and oil with a "fortune" wrapped inside. A "fortune" is a piece of paper with words of faux wisdom or a vague prophecy. In the United States and Canada , it is usually served with Chinese food in Chinese restaurants as a dessert. The message strips, maps A map is a visual representation of an area—a symbolic depiction highlighting relationships between elements of that space such as objects, regions, and themes, paintings Painting is the practice of applying paint, pigment, color or other medium to a surface . The application of the medium is commonly applied to the base with a brush but other objects may be used. In art the term describes both the act and the result which is called a painting. Paintings may have for their support such surfaces as walls, paper,, milk cartons, cereal boxes
- Non-Classical Digital Documents: Web pages A web page or webpage is a document or resource of information that is suitable for the World Wide Web and can be accessed through a web browser and displayed on a monitor or mobile device, blogs A blog is a type of website or part of a website. Blogs are usually maintained by an individual with regular entries of commentary, descriptions of events, or other material such as graphics or video. Entries are commonly displayed in reverse-chronological order. "Blog" can also be used as a verb, meaning to maintain or add content to a, wikis Wikis may exist to serve a specific purpose, and in such cases, users use their editorial rights to remove material that is considered "off topic." Such is the case of the collaborative encyclopedia Wikipedia. In contrast, open purpose wikis accept content without firm rules as to how the content should be organized
- Boundary Examples: The Pioneer plaque The Pioneer plaques are a pair of gold-anodized aluminum plaques which were placed on board the 1972 Pioneer 10 and 1973 Pioneer 11 spacecraft, featuring a pictorial message, in case either Pioneer 10 or 11 are intercepted by extraterrestrial life. The plaques show the nude figures of a human male and female along with several symbols that are on the Pioneer 11 Pioneer 11 was the second mission of the Pioneer program (after its sister probe Pioneer 10) to investigate Jupiter and the outer solar system, and the first to explore Saturn and its main rings. Pioneer 11 used Jupiter's mass in a gravity assist to alter its trajectory toward Saturn. The unmanned spacecraft was developed by NASA Ames Research spacecraft, designed by astronomer Carl Sagan Carl Edward Sagan (November 9, 1934 – December 20, 1996) was an American astronomer, astrophysicist, author, cosmologist, and highly successful popularizer of astronomy, astrophysics and other natural sciences. During his lifetime, he published more than 600 scientific papers and popular articles and was author, co-author, or editor of more than, and using information assumed to be universal is an extreme example of a document that is intended to communicate with aliens. Conversely, the recorded and printed signals of the SETI project would constitute documents if they were discovered to contain alien communication.
Social aspects of documents
Documents play a key role in the construction of social reality (Searle, 1996) and therefore play a part in accounts of every important aspect of human society and culture. An example of this type of account is in the seminal account of the role of print in political evolution, Imagined Communities, (Anderson, B., 2006). More direct examples include the works of Marshall McLuhan Herbert Marshall McLuhan, CC was a Canadian educator, philosopher, and scholar—a professor of English literature, a literary critic, a rhetorician, and a communication theorist. McLuhan's work is viewed as one of the cornerstones of the study of media theory (McLuhan, 1964 and 1969). Many key social aspects of documents arise from their historically unchanging character. This aspect leads to a definition of a document as a talking thing (Levy, D., 2003), whose strengths and weaknesses both arise from its relative (historical) immutability with respect to oral forms of communication. The relative immutability of documents has thus historically been important for establishing a record of transient events, or for preserving information whose precise linguistic form is of ritual or practical importance (such as religious texts or legal documents). Note though, that historically many societies have accorded greater authority to disciplined oral traditions as more reliable than parallel written ones. With this caveat in mind, the following social aspects of information may be noted.
- Social Value: The information in documents as well as documents themselves are often valuable; the information because of the influence represented, and the document itself when it is believed to be a rare or unique and authentic representation of the information it contains.[citation needed]
- Manifestation of authority: Documents are often produced to provide a record that will be considered authoritative in the future, particularly with respect to government. Consider receipts, titles, and deeds as examples of proof of ownership, and passports or driver's licenses as proof of identity.
- Conventional: Documents inherit a key feature of language-based communication in general: they are denoted as documents by convention (Lewis, 2002). Virtually any medium can constitute a document provided the people involved can agree on the meaning represented. Hence cave drawings, hieroglyphics, scrolls of sheepskin, sheets of papyrus, ink on paper, magnetic tape and electronic files are all documents under certain accounts.
- Manifestation of economic labor: Historically, the effort required to produce a document has been significant, so only the most important documents were created. The Illuminated manuscript An illuminated manuscript is a manuscript in which the text is supplemented by the addition of decoration, such as decorated initials, borders and miniature illustrations. In the strictest definition of the term, an illuminated manuscript only refers to manuscripts decorated with gold or silver, but in both common usage and modern scholarship, the of the pre-Gutenberg era demonstrates the cost (and associated imputed value) of documents. Historically, the cost of producing documents has declined, while their functional characteristics ("affordances" in the sense of Sellen and Harper, 2001) have become richer.
- Manifestation of business processes: Documents play many roles in the internal management of a business as well in the interfaces between businesses and their suppliers, employees, and customers. Current trends toward longer value chains and increased regulation increase the number of documents that must be generated and processed.
- Instruments of Governance and Law: The unchanging aspect of documents is crucial to the consistent communication of policy and administration of law to citizens. Documents that play such roles include constitutions, corporate annual reports and religious texts.
- Analytical philosophical character: The notion of document plays a role in political philosophy (example, the notion of social contract as a primitive construct), as well as in the philosophy of law
- Role in Religion: Documents play a key role in religion, and constitute canonical content. Document-related terms such as dogma and doctrine have today acquired pejorative connotations primarily due to historical events associated with religious documents.
- Cultural Significance: Documents play a central role in art of all varieties. In the movie Office Space Office Space is a 1999 American comedy film written and directed by Mike Judge. It satirizes work life in a typical 1990s software company, focusing on a handful of individuals who are fed up with their jobs. The film's sympathetic portrayal of ordinary IT workers garnered it a cult following among those in that profession, but the film also for instance, central plot elements are frustration with bureaucratic process involving the fictional "TPS reports" and a malfunctioning printer In computing, a printer is a peripheral which produces a hard copy of documents stored in electronic form, usually on physical print media such as paper or transparencies. Many printers are primarily used as local peripherals, and are attached by a printer cable or, in most newer printers, a USB cable to a computer which serves as a document.
- Metaphoric Significance: Metaphors based on documents permeate our thinking, ranging from the obvious ("let's start with a clean sheet for this design", "this is a new chapter in my life" and "she wrote the book on that") to the highly allegorical ("All mankind is of one author, and is one volume; when one man dies, one chapter is not torn out of the book, but translated into a better language; and every chapter must be so translated" — John Donne).
Functional characteristics
Documents also manifest several, more localized characteristics that determine how we use them in everyday life:
- Manifest nature: Information is physical, i.e. it always must exist in a tangible form, even when digital. IBM computer scientist Rolf Landauer Rolf William Landauer was an IBM physicist who in 1961 argued that when information is lost in an irreversible circuit, the information becomes entropy and an associated amount of energy is dissipated as heat. This principle is relevant to reversible computing, quantum information and quantum computing is credited with this observation and working out its implications. By virtue of being realizations of chunks of information, documents are necessarily physical in all their forms.
- Contextuality and Situatedness: All communication takes place in a context, which includes at least the shared understanding of the parties communicating (Lewis, 2002). Explicit and implicit references to the context can convey a large amount of meaning by building on the shared understanding, but that meaning is lost to another party that does not share that context. For example, Shakespeare in the original would be incomprehensible to modern readers simply because of the evolution of language and spelling since the seventeenth century, and modern readers (besides Shakespeare scholars) normally read modernized versions. Similarly, hypertext documents exist in a context which is lost if printed, leading to a different offline reading context.
- Evolvability: When we think of a document as a definitive source containing the best known information about a topic there is need to change that information as more is learned. This is frequently done by revising the document into a new version or edition. Typically, older versions are archived to facilitate understanding how the document has changed. In modern contexts, when technologies such as wikis or software source code are under discussion, this evolvability can require very sophisticated version control technologies.
- Renderability: Every abstract entity An abstract object is an object which does not exist at any particular time or place, but rather exists as a type of thing . In philosophy, an important distinction is whether an object is considered abstract or concrete. Abstract objects are sometimes called abstracta (sing. abstractum) and concrete objects are sometimes called concreta (sing that is understood to be a document in some context can be rendered, often in more than one way. A rendition of a document refers to a particular physical or electronic representation of the information from the document. For example, a portable document format (pdf Portable Document Format is an open standard for document exchange. The file format created by Adobe Systems in 1993 is used for representing two-dimensional documents in a manner independent of the application software, hardware, and operating system. Each PDF file encapsulates a complete description of a fixed-layout 2D document that includes) representation and a web page may contain the same information but have substantially different properties and appearances. We think of them as different renditions (or renderings) of the same document. We might similarly consider different translations of a document to be the same document although differences in language context and structure may make it impossible to express precisely the same meaning in both languages.
- Affordances: Documents in digital and physical forms manifest various "affordances" (Sellen and Harper, 2001, Gladwell, 2002)). The affordances of a particular rendition of a document determine its uses. For example, paper has the affordances of allowing flipping and easy tactile manipulation, while digital forms are easier to edit.
Classical roles and workflows in document production
There are a number of roles in which people are involved in the creation and distribution of traditional paper documents (Romano, 1989); some, but not all documents are processed by people acting in each role, each of which may be performed by an individual or a group. Books are a well known example of documents that require an extensive publication process, but many other documents undergo similar processes to at least some of those from book publication. Each of these roles is considered to improve or add value to a document. These roles are generally understood as being clustered in various phases in the production of a classical document, including authorship, editing and prepress. Roles and workflows in the production of modern digital documents are more variable and are discussed in the section on future documents.
- An author An author is broadly defined as "the person who originates or gives existence to anything" and that authorship determines responsibility for what is created. Narrowly defined, an author is the originator of any written work selects the content to be communicated and performs the initial organization and recording of the content. A document in this state is often called a manuscript.
- A reviewer reads the content and evaluates it with respect to the intended audience. Reviewers often recommend only the best documents to be published. Documented reviews are frequently published as guidelines for document consumers as well.
- An editor helps to organize and express the content so that the meaning is clear and understandable, and follows the conventions of the symbolic representation such as spelling and grammar.
- A publisher orchestrates the process of producing a document, often decides whether a document is worth the effort of publishing (usually an economic decision), and collects and disseminates the profits from sales of a produced document.
- A printer formats the document into a comfortable form such as a bound book. Printing can be a very complex and elaborate process, including
- pagination - function performed by an individual who takes on the tasks of organizing text, fonts, images, headings, footnotes, chapters and sections to accommodate the physical constraints of a printed page aesthetically.
- pre-press—function performed by print shops in preparing paper documents for production.
- imposition - organizing desired pages on a larger media such that when folded and trimmed the pages will be upright and in order.
- printing - marking paper with ink or toner
- folding pages into sections
- binding pages together and covering
- trimming
- packaging
- A distributor manages inventory and physical distribution of printed documents to retailers.
- A retailer manages a local inventory and sales to consumers, and often is familiar with the content and can make appropriate recommendations.
- A librarian organizes, tracks borrowing of, and archives documents.
A publication process enables a consumer to purchase or borrow, read and learn from documents. Consumers are often the intended audience of the publication process.
Document production technology
Document production technology has evolved significantly through history. While a great deal can be said about ancient production technologies including papyrus, palm leaves, stone tablets and marking devices ranging from quills to chisels, the modern form of the document has evolved largely under the influence of printing technologies. The Illuminated manuscript of Europe is a useful prototypical instance of the document at the end of its evolution before the widespread use of printing. The associated technology was largely a human one. Other cultures at this stage used other forms of pre-print era documents. The history of printing can be traced as follows:
Bronze Age civilizations made extensive use of seals for commercial and transactional purposes. The particular case of the signet ring was of particular importance, and is still in use in place of signatures in East Asian countries like Korea, where it is common for individuals to carry a seal.
Chinese Woodblock printing was the first widespread technology that automated important parts of the document production process.
The Gutenberg Printing Press (McLuhan, M., 1969) enabled the mass production of faithful copies of documents, and hence the widespread dissemination of information. The widespread access to information enabled (and necessitated) fundamental changes to society in religion, government, law, business, and entertainment. Prior to the press the huge effort required to faithfully hand-copy severely limited the number of documents available, and hence access to the information contained therein. The effort to set type and prepare a document for reproduction was still high, but many high fidelity copies could be produced.
The development of Lithography constituted the next great advance in document production technology and continues today to dominate the economic landscape of document production, an economic sector estimated to be of the order of $1 trillion. Lithography brought economies of scale and extremely high quality and low cost to documents.
The typewriter improved the accessibility of document production technologies and enabled it to enter mainstream workplaces. Carbon paper enabled a modest number of copies to be produced concurrently with the original. A brief era of photography-based technologies flourished (including the photostat and cyclostyle processes) in parallel with the age of typewriters.
The Xerox Copier became a major milestone in document production by eliminating the typesetting effort required by a printing press. The Xerographic ("dry writing") technology (also referred to as electrophotography) could produce durable and economical copies of a paper document easily and quickly. Modern digital printers from Xerox and other companies such as HP, Canon and Ricoh, can produce more than 240 black and white or 170 copies of a page each minute, and work with up to 6 colors and dry and wet inks. This technology supports a $100 billion market in digital printing, particularly in domains where lithography has clear limitations.
Computers enabled information to be stored electronically in databases and electronic files on magnetic tapes, drums, and disks. This led to a radical disruption of all document production technologies. Initially most of this information was printed onto paper by teletypes (automated typewriters), but computer printers rapidly became faster and more sophisticated. Computers, by controlling lasers in xerography, micro-nozzles in inkjet systems, and tiny solenoids in mechanical systems, became capable of being serially embedded in the document production process. Computers are also critical to modern lithography.
A whole interaction style with computers was developed around the metaphor of working with documents and folders on a desktop, to the point that the word document is now commonly associated with the information stored in a computer file according to the metaphor. Today, electronic paper is viewed as one potential future evolutionary physical form of the prototypical document, as it can present the electronic document with the readability of printed paper.
Document life cycle management technology
Technology to manage documents has evolved in parallel with documents themselves. Of particular importance are practices concerning the preservation, archival, destruction and management of documents. These constitute what is known as the "document life cycle"
- Physical preservation: Documents in both traditional physical forms and in digital physical forms such as magnetic media must be physically preserved. This aspect of document management deals with such issues as the aging of paper (the innovation of acid-free paper is an advance in preservation) and obsolescence of magnetic media.
- Storage: This aspect includes management of scarce resources such as shelf space and disk space, and associated technologies such as optimal space utilization. Modern libraries such as the University of Nevada and the University of Michigan often use complex space-saving technologies such as robotic retrieval systems for stacks and moving bookshelves. In the digital realm, the entire discipline of compression technologies can be viewed as concerned with the storage of documents.
- Cultural Preservation: This function, traditionally ascribed to librarians involves the selection, arrangement and storage of documents in safe places. The importance of this part of document life cycle management can be seen in the impact of historical events such as the destruction of books in ancient China[citation needed] and the burning of the library at Alexandria. Today, library and information science has evolved into an important academic discipline.
- Bibliometrics: This aspect of document management involves functions of indexing, generating statistics and taxonomies, and improving the usability of large collections of documents. The modern history of this management technology dates back to Melvil Dewey and the Dewey Decimal System. Today, the science of bibliometrics is largely concerned with managing the impact of electronic technologies. This aspect must also deal with ISBN numbers, Library of Congress data and other standards.
- Digital Content Management: The explosion of digital content has resulted in technologies to manage large collections of digital information generated by organizations. Such systems must manage access control and privileges, multiple electronic format, interface with printing infrastructures and enable collaborative work flows around documents.
- Digital-Physical Interaction Management: As long as both paper and digital documents continue to have value, the modern management technologies to manage their interaction will continue. Key to this management is the management of large scale and systematic scanning of physical documents (such as the Google book scanning project).
- Destruction: With the increased cost of identity theft, corporate scandals and privacy concerns, the destruction of both paper and electronic documents has become increasingly important to manage. Technologies such as shredders play a role, as do verifiable processes of destruction of electronic documents to ensure compliance with privacy laws.
- Security: Shannon's information theory has led to an entire discipline that concerns itself with the security of documents, and associated technologies such as encryption, as well as more physical security features such as watermarks and making currency documents safe from counterfeiting.
- Transportation: The entire postal system, as well as modern courier systems, is largely built on the need to move documents physically from one location to the other.
The document economy
The economics of the production and management of documents indirectly impacts every economic sector. While the total economic value of the document economy is hard to estimate, the economic sectors with business models directly dependent on documents include:
- Document Authoring Technology: This sector supports a huge variety of digital and physical production technologies, ranging from Microsoft Word to LaTeX to advanced layout software.
- Education: The production and processing of documents is so critical that entire educational disciplines have evolved around writing, editing, layout and design of documents. The information sciences are also part of the document economy.
- Electronic Document Management: Managing documents within organizations and in public and personal contexts supports a huge industry in content management systems, ranging from free public infrastructure such as wikipedia to proprietary enterprise applications such as Docushare and Documentum.
- Physical Document Management: Large manufacturing sectors producing everything from 3-ring binders to filing cabinets and office desks exist largely due to the need to process documents.
- Media: The paper industry exists to support the document economy.
- Print equipment: From lithography and xerography to pencils and crayons, an extraordinarily diverse set of equipment industries depend on documents.
- Document Services: In large organizations, the life of documents in the work flows and processes of daily activity represent an enormous locus of value addition and cost reduction, which has led to a burgeoning industry in managed document services, ranging from specialized niches (such as payroll management by PayChex Inc.,) to managed office printing.
- Retail Production: From large chains such as Kinko's in the United States to small copy shops and offset print shops, documents support a large production sector for the end user.
- Publishing: All publishing, ranging from offset-based newspaper and magazine printing, to highly customized modern publishing using publish-on-demand digital print technology, is part of the document economy. The publishing industry includes major sub-areas such as the writer's market, small, medium and large publishing houses, small and large distributors and a vast network of independent and chain bookstores, online retailers, a large used-documents market and subscription-based markets.
- Document Transportation: The international postal system, as well as the commercial package transportation systems represented by companies such as DHL and UPS have economic models based largely on the demand for document transportation.
Future of documents
Since the advent of the digital era, documents have been rapidly evolving, and may require fundamental reconceptualization (Wesch, 2006). Efforts at this reconceptualization include Vannevar Bush's initial conceptualization of hypertext (Bush, V., 1945). The impact of digital technology can be understood in terms of several key aspects:
- Blurring the notion of document boundary: hypertext and Web content make it hard to determine what is being denoted by the term document. While the early days of the Web resulted in documents that mimicked their physical ancestors, Web content rapidly took on new characteristics. Reconceptualization of the notion of "boundary" is a key intellectual challenge (Sweet, 2002).
- Increasing structure and openness: The document is going from an opaque container of information to a much more open, structured document. XML is underlying most document formats today (OpenDocument or Office Open XML). In the future, it will become even more queriable, with the actual elements of this document being tagged — e.g. HR-XML.
- Dynamic nature: Web analogs of traditional paper documents like a newspaper column have taken on a dynamic character due to the impact of technology enabling the addition of comments from readers. The document will increasingly become "virtual", bringing up-to-date information from various sources in one container (a la "mash-up") - as such,it will be kept evergreen.
- Paper and electronic are reconciling: Paper has traditionally been a gap in document processing workflows. Technologies such as OCR, OMR, 2D Barcodes and Anoto pattern technology are helping get its content back into the electronic world. In the future however, Not only will that transition be seamless, but it will also be possible to track it while in the "physical" world through RFID and MemorySpot.
- Hybrid automated/human authorship: authorship workflows for digital documents have evolved to include the computer in a key role. Dynamic Web pages may be viewed as the joint output of a human author (who produces a template) and a software system (that fills in the template). Sophisticated examples of this phenomenon can be found in recent evolutions in paper documents as well. Variable data technology, for instance, allows creators of direct mail marketing documents to vary the content of every piece in a print run using technologies such as DesignMerge or Xmpie.
- Prosumer workflows: Content repositories such as Wikipedia radically alter traditional document production workflows by blurring roles such as author and editor.
- Customizability: Digital technology allows users to actively participate in the construction of documents they see, realizing the postmodern notion of construction of meaning in an unexpectedly literal way.
- Long Tail Economics: Technologies such as blogs have allowed document production economics to operate with such radically cheap cost structures that single individuals can derive an income from a global audience with low capital expenses. This has led to an explosion of niche content.
- Blurring of Documents and Interfaces: Technologies such as Ajax or Apollo blur the distinction between documents and user interfaces to "intelligent" technologies, leading to a whole class of smart documents that can go beyond the passive nature of traditional documents.
- Fluidity and Dynamic Microstructure: Distinct from the impact of hypertext on the notion of document is the fluid potential of modern documents at the microlevel, which allows an enormous variety of word and sentence level dynamic phenonomenology (Kelly, K., 2006).
See also
| Look up document in Wiktionary, the free dictionary. |
- Book
- Computer file
- Copier
- Document Automation/Assembly
- Documentality
- Hypertext
- Illuminated manuscript
- Lithography
- Seal (device)
- Typewriter
References
- Sellen, A. J. and Harper, R. H. R., 2001, The Myth of the Paperless Office
- McLuhan, M., 1969, The Gutenberg Galaxy
- McLuhan, M., 1964, Understanding Media: The Extensions of Man
- Faculty of Information Systems and Technologies
- Landow, G. P., 2006, Hypertext 3.0: Critical Theory and New Media in an Era of Globalization
- Bush, V., 1945, As We May Think, Atlantic Monthly, http://www.theatlantic.com/doc/194507/bush
- Kelly, K. 2006, Scan This Book!, New York Times Magazine, http://www.kk.org/writings/scan_this_book.php
- Owen, D., 2004, Copies in Seconds: How a Lone Inventor and an Unknown Company Created the Biggest Communication Breakthrough Since Gutenberg — Chester Carlson and the Birth of the Xerox Machine
- Searle, J. R., 1997, The Construction of Social Reality
- Anderson, B., 2006, Imagined Communities: Reflections on the Origin and Spread of Nationalism, New Edition
- Levy, D., 2003, Scrolling Forward: Making Sense of Documents in the Digital Age
- Gladwell, M., 2002, The Social Life of Paper, New Yorker Magazine, http://www.gladwell.com/2002/2002_03_25_a_paper.htm
- Lewis, D. K., 2002 Convention: A Philosophical Study (Revised edition)
- Pedauque, R. T., Document: Form, Sign and Medium, as Reformulated for Electronic Documents [1]
- Romano, F., 1989, Pocket Guide to Digital Prepress
- Sweet, J., 2003, Document Boundaries Master's Thesis, Rochester Institute of Technology
- Wesch, M., 2006, The Machine is Us/ing Us, video short documentary, http://www.youtube.com/watch?v=6gmP4nk0EOE
Categories: Documents | Information science
|
Thu, 22 Jul 2010 23:06:26 GMT+00:00
Orlando Sentinel Perry appointed a special magistrate to supervise the document inspection. And he allowed both parties to take notes during the inspection, a change from ...
758px x 600px | 64.40kB
[source page]
things that come with it So I have had to spend an hour inserting HTML code to format my document again I am almost done but I want to insert borders so that it looks something like this http www jeff barr com wp content d document png I just want the borders to provide a page like that I already have the text set up and everything but I need a border for it so that
unknown
hu, 29 Jul 2010 00:17:03 GM
A few weeks ago, as part of our discussion on the arrest of Bradley Manning for passing classified . documents. to Wikileaks, we asked where to draw the line between information and criminals secrets military. ...
Q. I am a college student and we are working on opening a excel document that is password protected. Just the particular document is protected, not the whole excel program. If there is anyone that could help or point me in the right direction that would be great. Thanks!
Asked by Jaws42 - Thu Apr 15 10:55:08 2010 - - 2 Answers - 0 Comments
A. Start by goggling "Excel password cracker".
Answered by www.Aeternus.sg - Fri Apr 16 23:05:45 2010


