TextExtraction.GetText

Syntax:

GetText()

Returned value:

Type: String.
The text taken from a file, or empty string in case of an error.

Description:

Extracts text from file. File type is determined based on file extension (i.e. no text will be extracted from a file with .tmp extension). Use IFilter interface to extract text from files. It is extendable. You can install additional modules in order to extract text from one more type of files. By default the text is extracted from the files of the following types (with extensions): ASCX, ASP, ASPX, CSS, HHC, HTA, HTM, HTML, HHT, HTW, HTX, ODC, STM, DOC, DOT, POT, PPS, PPT, XLB, XLC, XLS, XLT, TXT, EML. It is presumed that files with TXT extensions are text files with ANSI or UNICODE (UTF16) encoding.

Availability:

Thin client, server, thick client, external connection.

Note:

To be used in Windows NT 4.0, 2000, XP, Vista, 7 environments only.

Example:

FileName = "C:/log.txt";
Object = New TextExtraction(FileName);
Text = Object.GetText();
Message(Text);


    

1C:Enterprise Developer's Community