Menu
Is free
registration
home  /  Firmware/ Application programs for processing text information. Moscow State University of Printing Arts

Application programs for processing text information. Moscow State University of Printing Arts

The use of computers has fundamentally changed the technology of writing and publishing. The desire to simplify the work with different texts led to the creation of applied software focused on solving these problems. There are two main groups of programs for preparing text documents: text editors and word processors.

Text editors are programs that create text files without formatting elements. Editors of this kind are indispensable when creating texts for computer programs, they are clear and easy to use. The whole variety of modern text editors can be conditionally divided into three main groups:

The first includes the simplest text editors that have a minimum of capabilities and are able to work with documents in the plain text format .txt, which, as you know, with all its simplicity and universal support, does not allow you to format text more or less decently. This group of editors includes the WordPad editors and the very little-functional NotePad, as well as many similar products from other manufacturers (Atlantis, EditPad, Aditor Pro, Gedit, etc.).

The intermediate class of text editors includes fairly broad opportunities in terms of document design. They work with all standard text files (TXT, RTF, DOC). These programs include Microsoft Works, Lexicon.

The third group includes powerful word processors such as Microsoft Word or StarOffice Writer. They perform almost all operations with text. Most users use these editors in their daily work.

The main capabilities of word processors include the following operations, which form the basis of technology for working with texts:

    document creation;

    editing a document (moving through the text; inserting and replacing symbols; deleting, moving, copying, finding and replacing text fragments; canceling commands; exchanging fragments between different documents);

    preservation of documents in external memory(on disks) and reading from external memory to RAM;

    formatting documents, i.e. performing transformations that change the appearance of the document (design of individual characters, paragraphs, document pages as a whole - changing the line length, line spacing, text alignment, changing the type and size of the font, etc.);

    printing of documents;

    automatic compilation of tables of contents and indexes in the document;

    creating and formatting tables;

    introduction of drawings, formulas, etc into the document;

    check punctuation and spelling.

The modern object-oriented approach makes it possible to implement mechanisms for embedding and embedding objects - OLE-technologies (Object Linking and Embedding). This technology allows you to copy and paste objects from one application to another. For example, in a text document created in MS Word, you can embed images, animation, sound and even video fragments and thus get a multimedia document from a regular document.

Editorial and publishing systems. This class of programs is intended for typing, design and full preparation for the typographic publication of books and magazines. Examples of such systems are: Microsoft Publisher, Corel Ventura, Adobe PageMaker, Quark XPress. Publishing systems are indispensable for computer layout and graphics, greatly facilitate the work with multi-page documents, because they provide the ability to automatically paginate text, arrange page numbers, create headings, etc. Creation of layouts for any publication using such systems is greatly facilitated.

Application packages

· PPP are the most dynamically developing part of the software: the range of tasks solved with the help of PPP is constantly expanding. In many ways, the introduction of computers in almost all areas of activity has become possible due to the emergence of new and improvement of existing PPP.

· Advances in the field of microelectronics, leading to the emergence of more powerful in their functionality, are also the reason for the creation of new RFPs. In turn, the need to improve the characteristics of using the package when solving specific user problems stimulates the improvement of the architecture and element base of computers and peripheral devices.

· The structure and principles of constructing the PPP depend on the class of computers and the operating system within which this package will operate. The largest number of various PPPs have been created for IBM PC-compatible computers with the MS DOS operating system and the WINDOWS operating shell.

RFP classification.

1. Problem-oriented RFP is the most developed in terms of implemented functions and numerous in terms of the number of created RFP packages.

2. Word processors - specialized programs designed to work with documents (texts), allowing you to compose, format, edit texts when the user creates a document.

3. Desktop publishing systems (NIS) - programs designed for professional publishing and allowing electronic layout of a wide range of basic types of documents, such as a newsletter, a short color brochure and a voluminous catalog or trade application, a directory (Page Maker, Quark XPress, Frame Maker , Microsoft Publisher, etc.).

4. Graphic editors - packages designed for processing graphic information are divided into PPP for processing raster graphics and images (Adobe Photoshop), and vector graphics(intended for professional work related to artistic and technical illustrations with subsequent color printing, occupy an intermediate position between CAD and NIS (example: Corel Draw).



5. Spreadsheets are software packages designed to process tabularly organized data.

6. Work organizers are software packages designed to automate planning procedures, the use of various resources, both for an individual and for the entire company or its structural divisions. Subdivided into: project management (MS Project) and the organization of the activities of an individual (Lotus Organizer ACTI).

The main components of the application package

Since the appearance of the first computers, many applied developments have appeared, but, despite the diversity, their generalized internal structure can be represented in the form of three interrelated elements (Fig. 1):

  1. input language (macro language, control language) - represents a means of communication between the user and the package;
  2. subject support (functional content) - implements the features of a specific subject area;
  3. system software (system content) - represents low-level tools, for example, access to operating system functions.

Subject support reflects the features of the class of problems being solved from a specific subject area and includes:

· Software modules that implement algorithms (or their separate fragments) of applied tasks;

· Tools for building programs from separate modules.

Currently, the most common design of software modules in the form of libraries that can be connected statically or dynamically. Depending on the approach used by the developers to design and implement PPP, such libraries contain built-in classes and descriptions of their interfaces (when using object-oriented programming). When using the paradigm structured programming library modules contain procedures and functions designed to solve some independent tasks... In both cases, libraries are linked to other modules in the package with only input and output information.

Applied software systems for processing text and graphic information.

Text processing facilities

The whole variety of modern text editors can be conditionally divided into three main groups:

1. The first includes the simplest text editors with a minimum of capabilities and capable of working with documents in the plain text format.txt, which, as you know, with all its simplicity and universal support, does not allow more or less decent formatting of text. This group of editors includes the WordPad editors and the very little-functional NotePad, as well as many similar products from other manufacturers (Atlantis, EditPad, Aditor Pro, Gedit, etc.).

2. The intermediate class of text editors includes quite wide possibilities in terms of document execution. They work with all standard text files (TXT, RTF, DOC). These programs include Microsoft Works, Lexicon.

3. The third group includes powerful word processors such as Microsoft Word or StarOffice Writer. They perform almost all operations with text. Most users use these editors in their daily work.

Today, almost all powerful text editors are part of the integrated software packages designed for the needs of the modern office. For example, Microsoft Word is included in the most popular office suite Microsoft Office.

Similar MS Office programs- OpenOffice.org Writer, StarOffice Writer, Corel WordPerfect, Apple Pages.

There are many classes of software for working with computer graphics. There are usually two main types of computer graphics, which differ in the principles of image formation: raster and vector.

Graphic information processing tools

The simplest bitmap editors include PaintBrush, Paint, Painter, which allow you to directly draw the simplest bitmap images.

The main class of raster graphic editors is intended for processing ready-made bitmaps in order to improve their quality and create your own images from the existing ones. Such editors include such powerful programs like Adobe Photoshop, Corel PhotoPaint, Gimp and others.

As in the case of raster graphics, for working with vector graphics there is great amount software, the development of which is more difficult in comparison with raster. The main programs include:

CorelDraw is a professional graphics editor with rich settings and an advanced control system.

Adobe Illustrator - the main advantage of the program is that it, together with Adobe Photoshop and Adobe PageMaker, forms a powerful enough package for the layout of printed publications and the development of complex documents.

Macromedia Freehand is one of the most user-friendly and intuitive vector editors out there. The program is distinguished by its simplicity of control system and high performance, but its capabilities are somewhat more modest than those of previous editors.

STATE EDUCATIONAL INSTITUTION
HIGHER PROFESSIONAL EDUCATION
DON STATE TECHNICAL UNIVERSITY
    (DSTU)
COURSE WORK
by discipline informatics
option number 6

APPLIED SOFTWARE. CLASSIFICATION OF PPO. TEXT PROCESSING PROGRAMS

Rostov-on-Don
2010

Content

Introduction 3
1 Application software and its classification 5
1.1 Application software 5
1.2 Application software classification 5
1.2.1 General Purpose Software 6
1.2.2 Instrumental software special
destination 9
1.2.3 Professional grade software 10
2 Text-processing programs 12
2.1 Plain text editors 14
2.1.1 NotePad 15
2.1.2 KEdit 16
2.1.3 KWrite Editor 17
2.1.4 McEdit Editor 18
2.1.5 The Emacs Editor 19
2.2 Text editors with markup elements 25
2.2.1 Text editor MS Word 25
2.2.2 StarWriter Editor 31
2.2.3 Netscape Composer 35
2.2.4 KLyX Editor 38
Conclusion 40
References 41

INTRODUCTION

A personal computer is known to be a universal device for information processing. Personal computers can perform any actions to process information. To do this, it is necessary to compose an accurate and detailed sequence of instructions for the computer in a language it understands - a program on how to process information. By changing programs for your computer, you can turn it into workplace accountant or constructor, designer or scientist, writer or agronomist.
In addition, the trend of lowering the cost of computer equipment while increasing its productivity has led to the fact that computers are becoming a household item, such as a TV or refrigerator, which expands the scope of the PC even more. Accordingly, more and more varied software is required to meet the challenges of new PC applications. The continuous increase in the power of personal computers, peripheral devices, as well as the development of communication facilities give software developers more and more opportunities to fully satisfy the needs of end users. This is a graphical interface that has become a standard for any software, and the implemented capabilities for sending documents and data using the Internet, directly from an application program (Microsoft Word, Excel, Access, etc.), and the ability to use a computer as a repository of information due to the emergence of new types high-capacity drives and short data access times, as well as many other features and service functions.
During their execution, programs can use various devices for input and output of data, just like the human brain uses the senses to receive and transmit information.
By itself, the PC does not have knowledge in any area of ​​its application; all this knowledge is concentrated in the programs executed on it. Therefore, the often used expression "the computer has done" means exactly the fact that a program was executed on the PC that made it possible to perform the corresponding action. Currently, the entire software package is divided into system and user programs. The system software acts as an "organizer" of all parts of the PC, as well as external devices connected to it. User programs serve to perform any specific tasks in all spheres of human activity.
The number of programs installed on a modern computer is in the hundreds and even thousands. They provide a comfortable user experience.
The entire set of programs is included in the so-called computer software. The composition of the PC software is its most important functional characteristic. Software (SW) is a collection of programs of regular use necessary to solve user problems, and programs that allow the most efficient use of computing technology, providing users with the greatest convenience in work and minimum labor costs for programming tasks and information processing. Therefore, the topic of the work is relevant.

1. Application software and its classification

1.1. Application software

Application software (application software, application programs) - programs designed to perform specific user tasks and designed for direct interaction with the user. Unlike applied,system software (operating system) is used to ensure the operation of the computer itself and the execution of application programs.

Application software - software consisting of:
- individual application programs and software packages designed to solve various user tasks;
- automated systems created on the basis of these (packages) application programs.
Application packages (PPP) - serve as software tools for solving functional problems and are the most numerous class of software products. This class includes software products performing information processing in various subject areas.
Installation of software products on a computer is performed by qualified users, and their direct operation is carried out, as a rule, by end users - consumers of information, in many cases whose activities are very far from the computer field. This class of software products can be very specific for certain subject areas.
Application programs are designed to provide the use of computing technology in various fields of human activity. In addition to creating new software products, developers of application programs spend a lot of effort on improving and modernizing popular systems, creating new versions of them.

1.2 Application software classification

Application programs are designed to provide the use of computing technology in various fields of human activity. In addition to creating new software products, developers of application programs spend a lot of effort on improving and modernizing popular systems, creating new versions of them. Newer versions tend to support old ones while maintaining consistency and include a basic minimum (standard) of capabilities.
One of possible options the classification of software tools (PS) that make up the application software (APP) is shown in Fig. 1. Like almost any classification, the one shown in the figure is not the only possible one. Not even all types of application programs are presented in it. Nevertheless, the use of classification is useful for creating a general understanding of the application software.

Rice. 1. Classification of application software

        General purpose software
Despite the broad possibilities of using computers to process a wide variety of information, the most popular are programs designed to work with texts - text editors and publishing systems. Text editors are called programs for entering, processing, storing and printing text information in a user-friendly form. Experts estimate the use of a computer as a typewriter at 80%.
Graphical information processing programs have become very popular. Computer graphics are currently one of the fastest growing areas of software. It includes the input, processing and output of graphic information - drawings, drawings, pictures, texts, etc. - by means of computer technology. Various types of graphic systems allow you to quickly build images, enter illustrations using a scanner or video camera, and create animation clips.
Graphic editor allow you to use various artist's tools, standard image libraries, sets of standard fonts, editing images, copying and moving fragments across screen pages, etc. There are special programs for performing calculations and further processing of numerical information - spreadsheets. In the course of the activity of any specialist, it is often required to present the results of work in the form of tables, where one part of the fields is occupied by the initial data, and the other - by the results of calculations and graphical analysis. They are characterized by a large amount of processed information, the need for multiple calculations when changing the initial data. Automation of such routine work is done by spreadsheets.
One of the most promising directions in the development of computer technology is the creation of special hardware for storing gigantic arrays of information data, and subsequent non-numerical processing of their search and sorting. For computer processing of such databases, database management systems are used. DBMS is a set of software tools needed to create, process, and display database records. There are several types of DBMS: hierarchical, network, relational. When working with a DBMS, there are several sequential stages:
database design;

Database structure creation;

Populating the database;

Viewing and editing the database;

Sorting the database;

Search for the required entry;

Selection of information;

Creation of reports.
Typically, most popular database management systems support these steps and provide convenient tools for their implementation.
Computer publishing systems is a complex of hardware and software designed for computer typing, layout and publication of text and illustrative materials. The main difference between desktop publishing systems and text editors is that they are designed primarily for document preparation, and not for input and editing. The layout process consists in the design of the text and setting the conditions for the relative position of the text and illustrations. The purpose of the layout is to create an original layout suitable for printing a document.
There are various publishing systems, among which the following can be distinguished:
1. Adobe InDesign - a recently appeared package from Adobe, optimized for layout of documents of the widest profile, from one-page booklets to thick books, enriched with a set of specific visual tools.
2. Adobe PageMarker - another package from Adobe, with a rather complex interface and command system, but at the same time with high performance and rich capabilities, especially when working with color.
3. Corel Ventura Publisher - an alternative package from Corel, which has recently lost its position, but due to its versatility (has wide functions of conventional text and graphic editors, integration with the Web, support for various platforms) has not lost its relevance.
4. QuarkXPress is a fairly easy-to-learn and flexible publishing system, which is traditionally used by many publishing houses of newspapers, magazines, and advertising agencies.
The desire to combine the functions of various applications into a single system has led to the creation of integrated systems. Versatile integrated systems were developed on the principle of a single system containing text and graphic editors, spreadsheets and a database management system as elements. Examples: Framework, Works, Master. The modern concept of software integration is the cooperation of individual application software systems similar to the well-known MicroSoft Office package. The systems themselves included in the package are independent, moreover, they themselves represent a locally integrated package, since, in addition to their main task, they support the functions of other systems. For example, text editor Word has the ability to manipulate spreadsheets and databases, and the Excel spreadsheet has a powerful text editor built into it. To interface information data from various software systems, they provide for an import-export exchange system with recoding of data presentation formats.

        Special purpose software tools
Developers create special targeted software systems for specialists in a certain subject area. Such programs are called authoring tool systems. The authoring system represents an integrated environment with a given interface shell, which the user can fill with the information content of his subject area.
Expert system is a program that behaves like an expert in some narrow application area. Expert systems are designed to solve problems with uncertainty and incomplete initial data, requiring expert knowledge for their solution.
In addition, these systems must be able to explain their behavior and their decision.
The fundamental difference between expert systems and other programs is their adaptability, i.e. variability in the process of self-learning.
It is customary to distinguish three main modules in expert systems:

Knowledge base module;

Logic output module;

User interface.
Expert systems, which are the basis of artificial intelligence, have become widespread in science (classification of animals and plants by species, chemical analysis), in medicine (diagnosis, analysis of electrocardiograms, determination of treatment methods), in technology (troubleshooting in technical devices, tracking flight of spaceships and satellites), in political science and sociology, forensic science, linguistics, etc.
Recently, programs for processing hypertext information have gained wide popularity. Hypertext- this is a form of organizing text material not in a linear sequence, but in the form of indicating possible transitions (links), links between its individual fragments. Plain text uses the usual linear principle of information placement and is accessed sequentially. In hypertext systems, information resembles the text of an encyclopedia, and access to any selected text fragment is carried out arbitrarily by reference. The organization of information in hypertext form is used to create reference manuals, dictionaries, contextual help (Help) in application programs.
Extension of the concept of hypertext to graphic and sound information leads to the concept of hypermedia. The ideas of hypermedia have spread in network technologies, in particular in Internet technologies. The WWW (World Wide Web) technology has made it possible to structure the vast world information resources through hypertext links. Software tools have appeared that allow you to create such Web-pages. Mechanisms for finding the necessary information in the maze began to develop information flows... Popular search engines on the Internet are Yahoo, AltaVista, Magellan, Rambler, etc.
Multimedia(multimedia) is the interaction of visual and audio effects under the control of interactive software. The emergence and widespread distribution of compact disks (CD-ROMs) made the use of multimedia effective in advertising and information services, network telecommunication technologies, and training.
Multimedia gaming and learning systems are beginning to replace traditional “paper libraries”. Today, in CD-ROM libraries you can “walk” through museums, the Moscow Kremlin, and so on. using the “electronic guide”.

        Professional grade software
Each application of this group focuses on a fairly narrow subject area, but penetrates into it as deeply as possible.
AWP- a workstation of a specialist in a subject area, equipped with a computer and special software that helps to solve problems within the framework of the activity of this specialist (for example, an accountant's workstation, a dispatcher's workstation, a technologist's workstation, etc.).
The workstation can be equipped with additional non-standard equipment, for example, the workstation of telemechanics is equipped with a programmer for ROM chips, a debugging kit, etc.
Often, the term AWP is understood exclusively as a software product for automating the work of a specialist. This assumes that a computer is already available at the workplace.
CAD- Automation system for design work, or system computer-aided design, (eng. CAD, Computer-Aided Design) - a software package designed for the design (development) of production (or construction) facilities, as well as the design and / or technological documentation.
Modern CAD systems are used in conjunction with CAE (Computer-aided engineering) automation systems for engineering calculations and analysis. Some CAD systems contain integrated automation tools for engineering calculations and analysis.
Data from CAD systems are transferred to CAM (Computer-aided manufacturing - a system for the automated development of part processing programs for CNC machines or GAPS (Flexible Automated Manufacturing Systems)).
Working with CAD usually implies the creation of a geometric model of the product (two-dimensional or three-dimensional, solid), the generation of design documentation based on this model (product drawings, specifications, etc.) and its subsequent maintenance.
It should be noted that the Russian term "CAD" in relation to industrial systems has a broader interpretation than CAD - it includes CAD, CAM and CAE.
ASNI- The Automated System of Scientific Research (less often the terms SANI are used - the system of automation of scientific research and the SAE - the system of automation of the experiment) is a software and hardware complex based on computer technology, designed for scientific research or complex testing of new technology samples based on the receipt and use models of the investigated objects, phenomena and processes.
ACS- Automated Control System - a set of mathematical methods, technical means (computers, communication devices, information display devices, etc.) and organizational complexes that provide rational management of a complex object (process) in accordance with a given goal. ACS consists of a base and a functional part. The basis includes information, technical and mathematical support. The functional part includes a set of interrelated programs that automate specific management functions (planning, financial and accounting activities, etc.). Distinguish between ACS objects (technological processes - automated process control system, enterprise - automated control system, industry - automated control system) and functional automated systems, for example, design, calculations, logistics and other support.
ACS- Automated enterprise management system - a complex of software, technical, informational, linguistic, organizational and technological means and actions of qualified personnel, designed to solve the problems of planning and managing various types of enterprise activities
Pedagogical complexes- packages of applied programs for use in the learning process in various subjects

Finally, we emphasize once again not only the conventionality of the classification proposed above, but also the presence of intersections. So, each specific expert system can be easily attributed to professional-level software; the principle of hypertext is implemented in a number of authoring systems, etc.

2 Text-processing programs

One of the most common functions of a modern personal computer is the preparation of a variety of text documents. This section deals with software products operating in MS Windows and Linux OS and designed to work with texts. We will get acquainted with the principles of creating and editing both simple and more complex documents.
There are two main groups of programs for preparing text documents: text editors and word processors.
Text editors are basically programs that create text files without formatting elements (that is, they do not allow you to select parts of the text with different fonts and typefaces). Editors of this kind are indispensable when creating texts for computer programs.
Word processors are able to format text, insert graphics and other objects into a document that are not related to the classic concept of "text". It should be noted that this division is arbitrary - the variety of programs for word processing allows you to find an editor with any set of functions.
Some word processors are so-called WYSIWYG editors. The name is derived from the first letters of the phrase What You See Is What You Get - what you see is what you get. When they say that this is a WYSIWYG editor, they guarantee that the appearance of the document on the computer screen and its printed copy is fully consistent. These types of editors include Word and StarWriter.
Some modern editors support the "almost" WYSIWYG concept. At the same time, the appearance of the document on the screen is slightly different from how the printed document will look, but this is done on purpose in order to more efficiently use the working window of the document. Examples of "almost" WYSIWYG editors are Netscape Composer and KLyX.
Many text files are sent as plain text. Plain text is difficult to make attractive and easy to read, since it does not have various typefaces, graphics, headings, subheadings, etc. These additional features are called markup.
Speaking about text markup, the concepts of physical and logical markup are distinguished. When using physical text markup, the exact appearance of each fragment is indicated. For example, "centered text, 14 point, bold, Times typeface." With logical markup, the logical meaning of this fragment is indicated, for example, "this is the chapter title". These two markup methods are generally intended to be used in different situations. In order to print text on a printer, you need to use physical markup. Decisions have to be made about the size of the margins and paragraph indents. Early versions of word processors used only physical markup. At the same time, the font, size and style were indicated for each fragment.
When communicating with other people, the physical layout of the text imposes a number of restrictions, especially for online documents. Screen size, resolution, fonts are different for different systems. For these reasons, more and more logical text is used. In some cases, logical design is practically necessary: ​​when creating electronic documents such as WWW pages or when creating and publishing voluminous works such as books.
To preserve the markup of documents when transferring text information from machine to machine, use different ways... Word processors and publishing systems use specially designed file formats that contain not only text, but also information about how it should be formatted. The main problem here is the incompatibility of such formats, although the most complex programs can usually read files in the formats of competing programs. Examples of this approach are Word and StarWriter word processors.
In another approach, special markup commands are inserted directly into the text of the document. Even if you don't have software that supports this format, you can still figure it out. There are many ways to represent text markup in this way, including:
HyperText Markup Language (HTML) used on the World Wide Web;
TeX and LaTeX, popular with many academic publications as well as mathematicians, physicists, chemists and even musicians.
Examples of programs that can mark up text in this way are Netscape Composer and LyX (KLyX).
Files created by different editors often have unique extensions that make it possible, without looking inside the document, to guess about how to mark up text. For example, files created by editors for preparing plain-text often have the extension .txt, and those prepared in Lyx - .lyx. The Word word processor creates files in MS Word format by default (extension .doc), but it also supports other formats, such as RTF (extension .rtf). Documents containing HTML markup commands have the extension .html or .htm.
It is obviously impossible to list all text editors. Many of them are "sharpened" for one or another specific activity. The list below shows only a small selection of text editors.

Plain text editors

NotePad - built into the Windows operating system, easy to understand and use;
McEdit - resembles MS DOS Edit, component file manager mc (Midnight Commander) Linux OS;
KEdit is the simplest text editor included with KDE Linux;
KWrite is a text editor that has a number of additional settings compared to other simple text editors;
Emacs - combines the functions of a file manager and a text editor; one of distinctive features is the ability to create macros (macros); is available in all clones of Unix, including Linux; Emacs can be used on MS Windows too.

Editors who create text with markup elements

Word - serves to create a variety of printed documents, is a component of office applications in MS Windows;
StarWriter - included in the StarOffice program, in appearance and functionality it is similar to Word, works equally well in both MS Windows and Linux;
LyX (KLyX in KDE) is a modern text editor designed for people who want to get a document that looks professional, but spend a minimum of time creating it; the editor inserts TeX and LaTeX markup commands into the text;
Netscape Composer - inserts HTML markup commands into the text; there are versions for both Linux and MS Windows.

      Unformatted text editors
Simple and easy-to-use plain-text editors are constantly required by programmers when writing program texts, creators of Web-documents who are not satisfied with Web-editors for a number of reasons, people using TeX or LaTeX, and many others. The principles of creating, saving and editing documents in them are very similar and do not require painstaking work from the user to master them. We will get acquainted with the NotePad program included in MS Windows, as well as with the editors included in the KDE (Linux) environment: KEdit, KWrite, McEdit. Then we'll look at Emacs, for which plain text creation is only a fraction of its potential.

2.1.1 NotePad

NotePad editor is one of the most common tools for preparing unformatted text in the MS Windows environment. The editor window contains a menu bar with three items: File, Edit, and Search.

The File menu is used to work with files. It allows you to create, save and open an existing file. Document editing commands (copying, pasting, deleting a piece of text) are carried out using the Edit menu. To search for a piece of text, use the Search menu.
NotePad allows another way to invoke document editing commands - the context menu. To call the context menu, select a certain element of the document, and then right-click. To close the context menu without selecting a command, just click on the mouse button, placing its cursor outside the menu, or press the Esc key.
Various key combinations are used to highlight parts of the text and move, presented in the table. Shift + Right Arrow Select the character to the left of the cursor

Shift + left arrow
Shift + up arrow
Shift + down arrow
Shift + Home

Shift + End

PgUp / PgDn
Home / End
Ctrl + right arrow
Ctrl + left arrow
Ctrl + Home
Ctrl + End

Select a character to the right of the cursor Select the previous line
Select the next line
Select text from the beginning of the line to the cursor
Select text from cursor to end of line
Move one page up / down
Move to start / end of line
Move a word forward
Move one word back
Move to the beginning of text
Move to the end of the text

To set the cursor to a specific place, you can use the mouse: you need to move the cursor and left-click.
Before executing the commands for editing a fragment of text, you need to select it, for which you can use two methods: using the mouse, or using a key combination. In order to select text with the mouse, you need to position the mouse cursor at the beginning of the text fragment to be selected, and move the cursor while holding down the left mouse button to the end of the fragment. The slice remains selected until another slice is selected.
NotePad editor uses the so-called clipboard in its work. Fragments of text of arbitrary length are temporarily placed on the clipboard. A fragment from the previous position is copied or deleted, while it is automatically placed on the clipboard and then can be pasted anywhere in the text. The fragment is stored in the buffer until new text is inserted into it.

2.1.2 KEdit Editor

The KEdit editor is started by typing "kedit" on the command line, or from the KDE main menu. KEdit is an easy-to-use, fast and easy-to-use editor.

Basic typing techniques are similar to working with other basic text editors. All manipulations are carried out using the menu and toolbar. The toolbar contains buttons for file operations (create, open, write), edit (copy, paste, cut), print and send mail. It is a plain text editor and therefore contains no formatting tools.

The Settings menu allows you to change the appearance of the editor by installing fonts of different typefaces and styles. You can override the font and background color, spell check options, the number of characters per line, etc. You can choose the encoding in which the document will be saved.

2.1.3 KWrite Editor

Another editor designed for creating unformatted texts is KWrite. To run it, you need to type "kwrite" on the command line or use the KDE menu.
The KWrite editor is useful for typing text containing markup commands and program codes, since it is a syntax-oriented editor. The latter means that he is able to highlight special constructions and marking commands with color.
From an interface point of view, KWrite is similar to KEdit and has a menu bar and a button bar. In the File item, in addition to the standard items similar to those in the KEdit editor, the New View item has been added (opening a window containing a copy of the current document). In the Go menu item, the ability to insert so-called labels has been added, which accelerates the search for the desired text fragment. The Edit item contains standard editing commands, as well as the ability to insert the contents of the specified file into the text. KWrite maintains a so-called rollback log, which allows you to undo a number of actions taken.

In the Tools item, you can set / remove indents and add comments corresponding to the type of file extension.
The settings in KWrite are noticeably richer than in KEdit. In it, you can set the coloring of the syntax of any language: C, C ++, Java, HTML, etc., typeface, size, encoding and font color. In the KWrite Settings sub-item, you can set the line length limit (when word wrap is enabled), tab stop, undo levels, various indents, and selection options (including multiple and vertical selection). Here you can also define hotkeys for a large number of commands - cursor movement, file operations, editing, selection, search, replace and transition.

2.1.4 McEdit Editor

Users of the file manager Midnight Commander willingly use the built-in editor Mcedit. However, it can run as a standalone application, in which case you need to type "mcedit" in the command window.
Outwardly, this editor resembles the Edit editor from the set of utilities Norton Commander, well known to users since the days of MS DOS.
Moving around the document is carried out using the cursor keys, as well as using the mouse. To move to another line, press the Enter key. The selection of a fragment of text is carried out with the help of the mouse.
The table lists the assignment of menu items and some key combinations for working with text.

    Ctrl + y Ctrl + u
    Ins
    F7
    Shift + F7
    F4
    F3
    F5
    F6
    F8
    Ctrl + f
    Shift + F5
    Shift + F4
    F2
    F10
    Deleting a line Undoing the last editing operation
    Insert / overtype switch
    Search (regular expressions can be used)
    Redo the last search operation
    Replacement
    The first press is the beginning of the block mark, the second is the end
    Copy selection
    Move selection
    Deleting the current line
    Write to file
    Reading from a file
    Creating a new file
    Saving a file
    Completion of work

2.1.5 The Emacs Editor

All Unix systems, including Linux, use the Emacs editor extensively. As a plain text editor, it has many more advanced features and capabilities. Emacs is a handy file manager that makes it easy to manipulate files and directories.
When you start Emacs, it creates its own window, which, with the exception of the first and last lines, is reserved for displaying text. The first line is called the menu bar, and the last line is called the echo area or minibuffer window, where the command sequences entered are displayed and prompts appear. As soon as you stop for more than a second in the middle of a command, all signs for that command are immediately displayed. In the future, we will stick to the notation similar to those that appear in the echo area, for example, when you simultaneously press the Ctrl and x keys, the line Cx will appear in the minibuffer, and if you press the Alt and x keys, the line M-x (the keyboard key marked Alt in Emacs plays a special role: it performs the functions special key- META).
The echo area is also used to display the minibuffer: a window that is used to read arguments to commands, such as the name of the file to edit. When the minibuffer is used, the echo area begins with a hint line, which usually ends with a colon; in addition, a cursor appears on this line as it becomes the selected window. You can always exit the minibuffer by typing C-g.
The recommended way to use Emacs is to run it only once, right after you login. If you need to work with several files or different parts of the same document at the same time, you should use the C-x 5 2 command or use the Make New Frame item of the Files menu to create a new window (the so-called frame). It is allowed to split one window into several working areas (in Emacs terminology - windows), although this is less convenient than using frames. Command C-x 2 will split the window vertically in two, command C-x 3 - two horizontally, and the command C-x 1 will leave only one window.
Emacs stores the text of each file in an object called buffer. The first time a file is accessed, Emacs creates a new buffer. To see a list of buffers created in the current session, type C-x C-b.
To exit the Emacs editor, use the command C-x C-c. If at that moment there were buffers whose contents were not saved to disk, then you will be prompted to save them.
When you start the editor with the "emacs. &" Command, an Emacs window appears with information about the contents of the current directory (the & at the end of the command starts the application in the background). In this case, it plays the role of a file manager.
Moving through directories

The first two lines of the Emacs file manager window on the right contain a period (.) And two periods (..). The dotted line corresponds to the current directory. A line with two dots is used to navigate to the parent directory (that is, one level up in the file tree). If you want to enter any subdirectory of the current directory, then you need to place the cursor on the line with its name and press the Enter key.
To access the required file or directory, use the command C-x C-f. The Find File line will appear in the minibuffer window, which contains the path to the current directory. Editing it, specify the path to the directory you need in accordance with the syntax of the operating Linux systems... After entering the name of the directory or file, press the Enter key. If a file name is specified, the corresponding file will be opened or created.
Creating a directory
To create a directory, use the + command. The Create directory message will appear in the minibuffer, after which you need to enter the name of the directory to be created.
Copying a file or group of files
To copy a file, place the cursor on the file name and press the C key. After that, the Copy to line appears in the minibuffer window, in which you need to specify the path to the directory into which you want to copy the file.
If you want to copy a group of files, then before copying, you must first mark each file by pressing the m key, after which the mark * appears in the extreme left position of the line with the file name.
Renaming (moving) files and directories
To rename a file, place the cursor on this file and press the R key. A line with the name of the file (directory) appears in the minibuffer and the full path to it is indicated. To rename a file, just specify a new name, and to move - the path to a new directory.
Editing
To edit a file, you must instruct the editor to read it and prepare a buffer containing a copy of the file's text. This operation is called a file reference. Editing commands are applied directly to the text in the buffer, that is, to the copy inside Emacs. Your changes will appear in the file itself only after you save the buffer. As noted, the command C-x C-f is used to access the file. If you access a file that has already been opened, no new buffer is created.

You can start Emacs with a specific filename, for example "emacs file.txt &". In this case, the editor will create a buffer and load the contents of the specified file into it (if there is no file with this name, an empty buffer will be created), after which you can immediately start editing.
After making changes to the clipboard, you need to save its contents. Storing a buffer in Emacs means writing its contents back to the file with which the buffer is associated. To do this, use the command C-x C-s.
If the buffer contains changes that have not been saved to the file, then the buffer is said to have been changed (or modified). This means that changes will be lost if the buffer is not saved. Two asterisks in the status bar of the editor window indicate that the buffer has been changed.
Emacs from time to time (based on the count of the keys you pressed) writes out any modified buffers by itself. This operation is called autosave or autosave. Autosave creates a file whose name differs from the name of the original file by the presence of a "#" at the beginning and end of the name. For example, the buffer associated with the file exam.txt will be saved to the file # exam.txt #. When you save the buffer, Emacs deletes the autosaved file.
To create and edit a document you can use the menu bar at the top of the Emacs window. The menu bar contains the following items:
Buffers - serves to navigate through previously created buffers with files;
Files - this menu duplicates commands for working with files;
Tools - serves to perform many different actions; note that it contains the ability to work with multiple buffers;
Edit - when editing a document, you can use this menu, among the functions there is the ability to check spelling (Spell);
Search - this menu is needed to search for fragments of text in a document;
HTML (or TeX, Java, etc.) - appears if Emacs "recognizes" the file extension, in this case additional features are provided that are specific to files with this extension (for example, if it is a file with the html extension, then it becomes possible to insert the most common hypertext markup commands);
Help - help menu.
It is convenient to select and copy text using the mouse. The selected text fragment can be deleted by right-clicking on it, as well as inserted into any part of the document by pressing the middle mouse button or by executing the C-y or Shift + Insert command.
To switch the input from Russian to English text and vice versa, the capabilities provided by the operating system are used. In addition, Emacs is often configured to switch with the F5 key. Emacs allows you to use two modes of keyboard layout for Russian letters: typewriter layout ("ytsuken", as a rule, indicated on the keyboard) and phonetic layout (by consonance, the key with the English letter k also contains Russian k). Switching between them is carried out with the F6 key, or standard means switch keyboard.
Let's list some commands of the Emacs editor for working with text. If you make a mistake when entering a command, the C-g key combination will interrupt its input.

command Appointment
DEL C-d
C-k
M-d
M-DEL
C-y
C-a
C-e
M-f
M-b
C-n
C-p
M-l
M-u
M-c

C-x u
Esc (number command)

Delete character before cursor position Delete character after cursor position
Destroy everything to the end of the line
Destroy the word
Destroy the previous word
Recover the last destroyed text
Move cursor to start of line
Move cursor to end of line
Move cursor one word forward
Move cursor one word back
Move cursor one line down
Move cursor one line up
Convert word to lowercase
Convert word to uppercase
Convert word to lowercase with first capital letter
Undo the last command (undo)
Repeat the specified number of times the command

It is difficult to list all the features of Emacs in such a small tutorial. In addition to editing texts, it allows you to perform many other functions. Below are just a few of them that will allow you to take a break from work and relax.
First, let's note one convenient feature provided by the editor - it is auto-continuation when entering commands in the minibuffer line. After typing the beginning of a long sequence of characters, use the Tab key to display the possible continuation of the command.
Emacs can also act as a desk calendar. To enter this mode, type M-x calendar. The window will split into two, the lower one will show a three-month calendar centered on the current month with the current date highlighted. The calendar image automatically scrolls when you move out of the visible area. Some of the commands for working with the calendar are listed below.
If you get bored, you can try M-x hanoi. As the name of the team implies, this is an implementation of the famous "Tower of Hanoi" puzzle. If you are very bored, then give a numeric argument, for example, M-x hanoi 5.
Playing tic-tac-toe on an endless board will require your personal involvement. It starts up by command M-x gomoku; the winner is the one who manages to build a row of 5 elements.
M-x dunnet launches a great adventure game.
If you are upset, run the famous Eliza program (a sample program that simulates artificial intelligence). She acts as a psychotherapist who is ready to help you sort out your problems. Enter M-x doctor to start, and end each of your answers to the doctor by double-clicking on the Enter key. To end the conversation with the therapist, use the quit command.

2.2 Text editors with markup elements

2.2.1 Word processor MS Word

Microsoft Word is a powerful text editor that is widely used in Windows environment... It is a convenient tool for preparing a variety of letters, business documents, reports. With its help, it is convenient to create both forms and questionnaires, as well as articles, brochures.
At the heart of the design of documents in Word is a system of templates and formatting styles that allow you to achieve the uniformity of the design of most documents. Word belongs to WYSIWYG editors: the printed document looks the same as on the screen.

Word saves text files in native MS Word binary format by default (the corresponding file extension is .doc). The text version of this format is RTF format (Rich Text Format), documented by Microsoft and supported by word processors of some other companies. Due to its textual structure, RTF is much safer in terms of spreading computer viruses, while DOC files are one of the tools for transferring viruses between computers. Recent versions of the processor can save the file in a format that includes hypertext markup elements (.html or .htm).
etc.................

Data processing. Database management systems.

Lecture 11. Application software,

Application software (APO) constitutes end-user programs. This is the most extensive class of software.

PPO, classified by purpose, can be divided into general-purpose application programs and special purpose applications... (fig.11.1). This classification is rather arbitrary.


General purpose application software.

General purpose software is used to solve the most general information problems in any area of ​​human activity. It integrates widely used programs by most PC users, for example, word processing, spreadsheets, graphics systems, games, entertainment.

Text processing programs. These include text editors, word processors. The border between them is rather arbitrary. Text editors (Notepad) are capable of performing basic editing functions - typing, making corrections, saving, working with fragments. Word processors (Word), in addition, have a variety of design options - document formatting.

Spreadsheets. The main purpose of spreadsheets is to process various types of data presented in tabular form (Excel, Lotus 1-2-3). The main advantage here is the automatic change of some cells, depending on the change in the content of others.

Database management systems (DBMS). Programs of this class (Access) allow you to work with large amounts of structured data - databases (as a rule, these are tabular structures). The functions of the DBMS include the description of data, access to data, search, selection of data according to certain criteria. Most modern DBMSs allow you to create small data processing programs in built-in languages, have design capabilities that allow you to create reports and queries based on the data. Many DBMSs, as well as word processors, have their representatives in general and special purpose software. At the general-purpose application software level, these are desktop DBMS, at the special level, these are large DBMS that form the basis of information systems and allow working in computer networks.

Graphics systems. These are programs designed to work with graphic images. These include raster and vector graphics editors, 3D graphics processing programs.

Integrated software tools. Individual programs, being a powerful tool for solving a range of applied problems, cannot fully satisfy the user. Software developers very often resort to integration into unified software packages several software products, while unifying the formats of processed files. The most common product of this class is the MS Office package, which includes Word, Excel, Power Point, Access, Outlook in a single package.

Software for solving applied mathematical (statistical) problems allow you to make mathematical calculations, analytical calculations.

Translators. Games. Entertainment. A class of programs for translating textual information, game programs, programs for viewing slides, listening to sound files, video files.

PPO for special purposes.

Solves more narrow tasks, as well as tasks of a professional nature in various subject areas.

Information Systems (IP), providing ample opportunities in:

- enterprise management- this is a warehouse, office workflow;

- accounting- these are systems that have the functions of text, table editors and DBMS. Designed to automate the preparation of the initial accounting documents of the enterprise and their accounting;

- analysis of economic and financial activities; they are used in banking and exchange structures. They allow you to control and predict the situation in the financial, trading markets, etc.

Expert systems represent the further development of database management systems. Designed for analyzing data stored in knowledge bases. Unlike DBMS that perform data manipulation operations, expert systems produce logical analysis data, self-learning function.

Computer-aided design systems- designed to automate design processes, used in mechanical engineering, construction, architecture.

Professional software products include accounting systems, automated control systems, automated research systems, etc.

Cuneiform screen Stylus 3.0 working window after calling the source text file: 1 - program title; 2 - menu bar; 3 - palette of "main" icons; 4 - a palette of icons "formatting"; 5 - a palette of icons "translation"; 6 - palette of icons "service"; 7 - window title source file; 8 - working window of the source file; 9 - information panel

Application programs are designed to perform certain functional tasks of computer publishing systems (for example, for processing text, mathematical, structural chemical formulas, sheet music, tables, vector and raster graphics, layout and layout of publications), as well as for performing a number of office tasks.

Entering information from paper into a computer is currently a frequently performed task. Until recently, this problem was solved exclusively by coding from a computer keyboard. Most of the documents to be processed are presented in originals on paper. Therefore, an integral part of any office automation system are computer systems that include programs for scanning and optical recognition of document images. The task of recognition is to convert the input (scanned) image into movable text. In other words graphic image each input character is replaced by the designation of that character as used in the computer (i.e., a code understandable by the computer system).

Intelligent system Optical Character Recognition (OCR) Cuneiform operates in an environment Microsoft Windows 3.1 or later. CuneiForm context sensitive help system supported standard system Windows help. The system has the following technological capabilities:

    supports a wide range of desktop scanners;

    recognizes the scanned page (including multi-column and complex text);

    allows you to scan and write an image as TIFF, and start recognition later (it is convenient to scan a batch of documents);

    has an assortment of possibilities for recording files, allowing you to scan a series of pages, and the program will auto-record, assigning sequential names to images (Page1, Page2, etc.);

    separates text from graphics and converts it from a graphic image to a text file for one of the word processors, databases, or spreadsheets;

    has the function "Determine the area of ​​recognition", which allows selective recognition in selected parts of the page;

    recognizes letters of the Russian and English alphabets, excluding stylized fonts such as Gothic letters;

    recognizes all commonly used fonts (including bold, italic, and underlined) that can be mixed within a page, even within a paragraph or word;

    can process documents printed by typography, on LQ and NLQ dot matrix printers, inkjet, laser printers, on a typewriter (both monospaced and proportional printing are acceptable);

    can preserve original formatting and tabs and adjust indentation and alignment;

    has an internal text editor and dictionary control, with which you can control the quality of recognition and edit text. To do this, the built-in editor window shows the recognized text with the highlighting of doubtful characters and words that are not in the dictionary. In the extension window located next to it, you can see an enlarged image of the scanned text, which allows editing without looking into the original document;

    does not recognize handwritten text.

The Cuneiform screen contains four main parts, as indicated in the figure.

The main actions(procedures) when working in the system are as follows:

Panel Button

Function performed

Scanning and Recognition The "Scan and Recognize" button will be useful if you are sure that the document layout is simple enough and will not complicate the text recognition (for more complex fragmentation, the "Scan and Show" operation is useful). When you click the button in the Quick Access Toolbar (or call the Action \ Scan and OCR command), a message window appears, letting you know that scanning is in progress. Further, the "Progress-indicator" field shows the current phase of character recognition, as well as the total percentage of recognition performed. Pressing the button is equivalent to performing the commands "Scan and Show" and then "Recognize".
Open image Allows you to load an image from an existing graphic file (using the button or the command File \ Open Image). In the window that appears, you must select the names of the directory and file.
Scan and show Allows you to scan a document and view it (using the button or the command Action \ Scan and Show) without starting character recognition. Scanning will start immediately and the scanned image window will appear on the right side of the screen.
Recognition It is used in the case of starting (using the button or the Action \ Recognition command) symbolic recognition after performing the "Scan and View" action or after downloading a file.
Brightness Allows you to set the optimal brightness level for scanning a document (using the button or the command Action \ Brightness). CuneiForm offers 256 levels of brightness (0-255). By default, its value is 127. If there are many recognition errors due to the fact that the scanned image is too light, you need to set a lower brightness and rescan the document. If the image is too dark, you need to set it to a higher brightness. The brightness level can be changed as follows:
a) by moving the slider on the scale to the right and left;
b) setting the brightness value automatically. To do this, press the "Automatic" button in the "Brightness setting" field. A small dialog box will appear, suggesting: "Select left button mouse to select the brightness or click Cancel. "When you move the cursor over the image, it will change to a light bulb with a cross next to it. Next, move the crosshair to the part of the image that has medium density or darkness, and then click the mouse button. The automatic brightness function examines the area around the point selected by the operator in order to determine the overall brightness setting. Next, you will be prompted to rescan the page using the new brightness value. If you select "Yes", the document will be rescanned. This should improve the accuracy when you restart the recognition. You can also call the window " Adjusting Brightness "by selecting" Brightness "from the" Action "menu. Regardless of where it is set, the current brightness level is always displayed at the left end of the status bar at the bottom of the screen.
Previous image The action is performed by pressing the button or the command File \ Restore previous image to return the last image that was on the screen during this session.
Page stitching It is used when there is a need to scan a full sheet with a hand-held scanner (which has a capture window smaller than the page size). By pressing the button or the command Action \ Glue pages, the process of processing the right or top parts of the page is called in accordance with the selected gluing method. First, the TWAIN dialog for working with the scanner is called and the corresponding part of the page is scanned, and then the recognition process is started. The entire chain of actions is automatically repeated for the next part of the page. The next stage is the gluing of these two recognized parts into a single text.

Status bar may include the following main elements installed inside the main menu: (see figure).

The Expansion window is designed to display a part of the scanned image at a higher magnification. This window appears on the screen when there is a need for a more thorough viewing of the fragment. The window is called in the menu View \ Extension. The magnification in the extension window is controlled by selecting the menu View \ One to One, 200% or 400% magnification. The place in which the Extension window appears on the screen depends on the actions as a result of which it appeared:

    if it is called at the location of the cursor after using "Scan and Show" or after opening a file, it will appear in the lower left part of the screen;

    if it appears together with the "Editor" window after recognition, then its position depends on the screen design selected from the "Window" menu;

    the extension window can be moved around the screen by moving it in the mode Drag and Drop.

Manual fragmentation window used when the recognized document has a complex structure consisting of more than one column of text and / or graphics.

In normal mode, after scanning, CuneiForm automatically fragments and decomposes the image into blocks, which are considered as connected, meaningful chunks of text and are surrounded by a dotted line when displayed in the "Fragmentation" mode. However, a text block is just a collection of symbols and does not always constitute a complete semantic fragment.

Because the program bases its division on gaps rather than analyzing contextual information, it cannot perfectly assess which text is related. The Manual Fragmentation window provides several ways to further create blocks when automatic fragmentation is imprecise.

The "Fragmentation" mode exists just to correct mistakes made during automatic fragmentation before recognition. In this case, select the menu Options \ Fragmentation. Moreover, the word "Fragm." appears on the right in the status line. The Manual Fragmentation window will appear later, after scanning a document or calling it from a graphic file.

In addition to the automatically marked blocks, the window includes a column of buttons lined up along its left side:

Button

Function performed

Execute Provides a transition to perform recognition after the necessary fragmentation actions
Return Redo Allows you to go back one step and redo the last action taken
Text Allows to select a block containing, for example, all parts of the table so that they merge into one fragment. To do this, after pressing the button, you need to mark a rectangle around the selected area with the cursor. As a result, a new block is created, which, when recognized, is considered as a single whole.
Horiz. Vert. Allows to indicate additional fragmentation horizontally or vertically. To do this, after selecting the button, place the cursor in the desired place and, in drag mode, mark the fragmentation border, respectively, horizontally or vertically.
Glue Allows you to "glue" fragments created using the "Horiz." and "Vert." To do this, just mark the fragments to be glued with the cursor and select the "Glue" button
Delete Allows you to remove the fragments marked with the Text button from the recognition process
Increase. Zooms in on a piece of text in the Manual Fragmentation window. To do this, after selecting the button, fix the cursor in the desired place in the text and click the mouse button
Didn't take it away. Allows you to return the enlarged piece of text to its original state. To do this, after selecting the button, fix the cursor in the selected place and click the left mouse button
Drawing Lets you create a new picture
Help Allows you to display help information about using the manual fragmentation control buttons

Additional options for correcting erroneous block division in the "Manual fragmentation" window:

    horizontal separation;

    vertical separation;

    gluing.

Suppose there are two columns of text in the document, but CuneiForm "does not see" this. On the contrary, it merges two columns into one block. Since further this can lead to problems in recognition, it is necessary to divide the columns into blocks:

    press "Vert." on the left side of the "Fragmentation" window;

    place the cursor where you want to start splitting;

    while holding down the mouse button, drag the cursor to the point where the separation should end;

    release the button.

CuneiForm will split the block vertically into two text chunks (the Split Horizontally function performs the same action on text that should be split horizontally).

In some cases, CuneiForm can automatically split a continuous piece of text into chunks. To correct this split, it is necessary to click with the cursor on each of the fragments and press the "Glue" button. The two fragments will be combined into one block.

In the Editor window on top there is a simple menu of five buttons: "Exit", "Save As ...", "Attach to ...", "Next. questionable ”and“ Add word ”. With their help, when using the editing window, it is convenient to perform a number of procedures.

The "Editor" window contains the recognition result. After recognition, the editing window covers the image in the image window. The text selected by the user in the edit window is synchronized with the image in the extension window. When you move the cursor in the editing window, the image in the extension window is shifted according to the new cursor position, and the selected character is highlighted

Errors made in the recognition process can be edited in one of the word processors later or using the "Editor" menu and the buttons of the editing window. In the latter case, using the View \ Settings menu, you can select the size of the symbols displayed on the screen for ease of editing.

A number of additional conveniences are provided for editing:

    opportunity to carry out automated check spelling after selecting the Options \ Vocabulary control menu (the so-called doubtful words, i.e. words that are not in the dictionary, are displayed on the screen);

    fast movement during vocabulary control using the "Next. doubts. " to the next word missing in the dictionary;

    the ability to open and use the user's dictionary imported in the File \ Dynamic Dictionary \ Load ... or Import ... menu;

    the ability to add to the user's dictionary a highlighted but correct word during checking by pressing the "Add word" button;

    the ability to export a custom dictionary in the File \ Dynamic Dictionary \ Unload ... or Export ... menu after using it or replenishing it.

Menu, command Action being performed
FILE
Open image read an image from a file of the following formats: TIFF 5.0, PCX, BMP, GIF, TARGA, JPEG
Restore pre. image get a scanned and stored image for selection new area recognition (if needed)
Remember image record image in TIFF 5.0 format
Open ED file open a file with previously recognized text
Remember in text write recognized text in files of a certain format, for example, ASCII, Smart ASCII, RTF, ANSI, Smart ANSI
Glue with text append the recognized text to the end of an existing text file
Select from TWAIN list select a scanner to use under the TWAIN interface
Get TWAIN image use image access via TWAIN
Dynamic Dictionary use user dictionary
Recognition module load / unload recognition module
Output quit Cuneiform
EDITOR
Track. doubtful go to next questionable word
Prev doubtful go to the previous questionable word
Mark questionable words on off. doubtful words highlighting mode
Search search for character strings in recognized text
Search trail repeat search
Add word legalize the word and add it to the list
Cancel adding make the word doubtful and remove it from the list
/

It is also possible to use the control panel at the top of the editing window, which consists of several buttons: Exit, Save As ..., Attach to ..., Next. doubts. and Add word /

CuneiForm command menu

Intuitia 2.0 for Windows uses omniphone technology (recognizes different fonts without any training). It provides recognition of images of text materials from files in TIFF formats, PCX, BMP, and all TWAIN scanners and the HP ScanJet family scanners (direct).

Illustrations are automatically excluded from the recognition process and can be saved in files or placed in other applications.

The program uses built-in spelling check and correction of recognized text. It distinguishes and retains the characteristic features of the font design of the text: the size of the fonts, their style.

Results can be saved in TXT, RTF files, and also directly redirected to other Microsof Windows applications (for example, Microsoft Notepad, Microsoft Write, Microsoft Word word processors or Microsoft Excel spreadsheet).

PenO "Man for Windows" handwriting input and recognition system- a tool for handwriting input, recognition and text editing with a stylus:

    it is possible to enter and edit both English and Russian as a written text in all Windows applications;

    the input process is similar to the usual use of a pen when writing from left to right, preferably in a neat handwriting and with a classic left tilt;

    editing already entered text is possible as a result of using standard functions (insert, delete, transfer, activate text fragments, etc.), as well as simple strokes of the pen.

Main technological characteristics of the system:

Panels required for operation (if they are not on the screen by default) are displayed in the VIEW \ Toolbars menu.

After the translation is completed, the window is automatically split into two parts for simultaneous rendering on the screen of the source file and the translation. Programs usually provide two modes of dividing the working window of a text file into parts: horizontally and vertically.

The programs usually considered in this section allow not only translating into another language, but, if necessary, editing the source text and translation, namely:

    enter new text;

    make corrections to the text;

    work with blocks of text (select individual fragments, delete them and change the location).

Setting basic parameters provides a significant list of settings that can be performed, for example, in the SERVICE \ Parameters menu:

Dashboard includes three tabs:

    used dictionaries - to display a list of used dictionaries for the selected translation direction;

    unfamiliar word - to display a list of unfamiliar words when processing the source file;

    reserved words - to display a list of words that do not need translation.

Preparing a document for translation usually involves performing the following actions to improve the reliability of the translation:

    spell check (for example, for Stylus 3.0 in the SERVICE \ Options \ Spelling menu);

    checking the correctness of the paragraph breakdown. For example, the Stylus 3.0 program provides the ability to visually check the correctness of the text breakdown into paragraphs (in fact, it is a check for the absence of unnecessary, empty paragraphs in the text). To do this, in the SERVICE \ Options \ Colors menu, mark the box "Use conditional icons").

In this case, a special icon in the form of an unshaded triangle, set by the program to the left of each paragraph, fixes the paragraph breakdown;

Menu, command Action being performed
ACTION
Batch Scan scan a series of pages. In the window that appears, there are five main fields: Options, Page Memorization, Create a document with the number of pages, Change, Output format. It is possible to combine their values ​​in different ways, depending on whether the scanner has an automatic feeder, whether it is necessary to start recognition before recording, control the recognition area, etc.
Batch recognition enable recognition of a series of pages. The functions of the Options, Modify and Results fields are identical to the corresponding functions in batch scanning.
Scanning and Recognition use scanning and automatic document recognition - usually if the document layout is simple enough and does not complicate text recognition (for more complex fragmentation, the Scan and display operation is very useful)
Scan and display scan a document and view it without running the first character recognition. The scanned image appears on the right side of the screen.
Language selection choose one of the possible languages: Russian (Russian-English mode is possible), English, German and French
Page stitching enable the mode of using a handheld scanner with a capture window smaller than the page size. This invokes the process of processing the right or the top of the page in accordance with the choice of the gluing method.
Recognize start character recognition after executing the Scan and Display mode or after loading an image file
Recognition area start the recognition mode for any part of the image. The area around the required part of the image is indicated with the mouse
Turn rotate the image that appeared on the screen after scanning or downloading a file 90 ° to the left (right) or 180 °
Scanner selection select scanner type
Permission choose a resolution of 200, 300 or 400 dots per inch (DPI)
Brightness set the optimum brightness level for scanning a document. CuneiForm offers 256 levels of brightness (0–255). By default, its value is 127
OPTIONS
Fragmentation select fragments in the case when the document to be scanned has a clearly complex design and consists of more than one column with text and / or graphics
One column configure CuneiForm to treat all characters in the scanned image as one column of text
Vocabulary control enable the mode in which CuneiForm will highlight doubtful words that need to be dealt with after document recognition
Adaptive Scanning use all CuneiForm tools to get the best possible image
Matrix printer select a mode so that CuneiForm recognizes documents printed on a dot matrix printer
Page orientation Choose from four possible orientations: Normal Page, Page Left, Page Flipped, and Page Right. If the selected orientation is different from normal, CuneiForm rotates all open or scanned images. The current orientation is displayed by an icon at the left end of the status line
Bonding method select one of the ways to glue the page: Vertical or Horizontal before using the Merge Pages command. The first will process the left half of the page first, and then the right. In the second case, the top and then the bottom of the page
Illustrations set the mode of saving the illustration together with the text, as it was in the original document, using the RTF format and specifying one of the types of available illustrations: black and white, grayscale, color
VIEW
Page view

To fit the window
· one to one
200%
400%

enable / disable image window

set the "Fit to window" mode for the image window
set the mode without magnification for the image window
set the magnification to 2 times for the image window
set the magnification to 4 times for the image window

Extension

· one to one
200%
400%

enable / disable extension window

set the mode without magnification for the extension window
set the enlargement to 2 times for the expansion window
set the enlargement to 4 times for the expansion window

Editor settings set letter size and representation of poorly recognized characters in the edit window
WINDOW
Horizontal division split the screen horizontally so that the Edit Window occupies the top of the screen and the Expansion Window occupies the bottom
Vertical division