Documentation TYPO3 par Ameos |
Public Member Functions | |
| tx_indexedsearch_lexer () | |
| split2Words ($wordString) | |
| addWords (&$words, &$wordString, $start, $len) | |
| get_word (&$str, $pos=0) | |
| utf8_is_letter (&$str, &$len, $pos=0) | |
| charType ($cp) | |
| utf8_ord (&$str, &$len, $pos=0, $hex=false) | |
Public Attributes | |
| $debug = FALSE | |
| $debugString = '' | |
| $csObj | |
| $lexerConf | |
Definition at line 73 of file class.lexer.php.
|
|
Constructor: Initializes the charset class, t3lib_cs
Definition at line 105 of file class.lexer.php. References t3lib_div::makeInstance(). |
|
||||||||||||||||||||
|
Add word to word- array This function should be used to make sure CJK sequences are split up in the right way
Definition at line 178 of file class.lexer.php. References charType(), and utf8_ord(). Referenced by split2Words(). |
|
|
Determine the type of character
Definition at line 329 of file class.lexer.php. Referenced by addWords(). |
|
||||||||||||
|
Get the first word in a given utf-8 string (initial non-letters will be skipped)
Definition at line 239 of file class.lexer.php. Referenced by split2Words(). |
|
|
Splitting string into words. Used for indexing, can also be used to find words in query.
Definition at line 116 of file class.lexer.php. References addWords(), and get_word(). |
|
||||||||||||||||
|
See if a character is a letter (or a string of letters or non-letters).
Definition at line 264 of file class.lexer.php. References t3lib_div::inList(). |
|
||||||||||||||||||||
|
Converts a UTF-8 multibyte character to a UNICODE codepoint
Definition at line 383 of file class.lexer.php. Referenced by addWords(). |
|
|
Initial value: array(
'printjoins' => array( // This is the Unicode numbers of chars that are allowed INSIDE a sequence of letter chars (alphanum + CJK)
0x2e, // "."
0x2d, // "-"
0x5f, // "_"
0x3a, // ":"
0x2f, // "/"
0x27, // "'"
// 0x615, // ARABIC SMALL HIGH TAH
),
'casesensitive' => FALSE, // Set, if case sensitive indexing is wanted.
'removeChars' => array( // List of unicode numbers of chars that will be removed before words are returned (eg. "-")
0x2d // "-"
)
)
Definition at line 83 of file class.lexer.php. |
1.4.6