Main Page   Namespace List   Class Hierarchy   Alphabetical List   Compound List   File List   Namespace Members   Compound Members   File Members   Related Pages  

ArabicParser Class Reference

#include <ArabicParser.hpp>

Inheritance diagram for ArabicParser:

Parser TextHandler List of all members.

Public Methods

 ArabicParser ()
void parseFile (const string &filename)
 Parse a file.

void parseBuffer (char *buf, int len)
 Parse a buffer of len length.

long fileTell () const

Static Public Attributes

const string identifier = "arabic"

Detailed Description

Parses arabic documents in NIST's TREC format, windows CP1256 encoding.

The following fields are parsed: TEXT, HL, HEAD, HEADLINE, LP, TTL, HEADER, FOOTER.


Constructor & Destructor Documentation

ArabicParser::ArabicParser  
 


Member Function Documentation

long ArabicParser::fileTell   [virtual]
 

Gives current byte position offset into file being parsed. Don't use with parseBuffer

Implements Parser.

void ArabicParser::parseBuffer char *    buf,
int    len
[virtual]
 

Parse a buffer of len length.

Implements Parser.

void ArabicParser::parseFile const string &    filename [virtual]
 

Parse a file.

Implements Parser.


Member Data Documentation

const string ArabicParser::identifier = "arabic" [static]
 

Reimplemented from Parser.


The documentation for this class was generated from the following files:
Generated on Wed Nov 3 12:59:23 2004 for Lemur Toolkit by doxygen1.2.18