mnoGoSearch 3.2.31 reference manual
Full-featured search engine software
Copyright © 2001-2005 by Lavtech.com corp.
Table of Contents
1.
Introduction
mnoGoSearch Features
Where to get mnoGoSearch.
Disclaimer
Authors
Contributors (in no particular order)
Frequently Asked Questions
2.
Installation
SQL database requirements
Supported operating systems
Tools required for installation
Installing mnoGoSearch
Possible installation problems
Creating binary distribution
Installation registration
3.
Indexing
Indexing in general
Configuration
Running
indexer
SQL back-end notes
How to create SQL table structure
How to drop SQL table structure
Subsection control
How to clear database
Database Statistics
Link validation
Parallel indexing
Supported HTTP response codes
Content-Encoding support
indexer configuration
Specifying WEB space to be indexed
Aliases
ServerTable
FlushServerTable
External parsers
Extended indexing features
News extensions
Indexing SQL database tables (htdb: virtual URL scheme)
Indexing binaries output (exec: and cgi: virtual URL schemes)
Mirroring
Using syslog
Disabling Apache logging
Storing cached copies
Configuring cached copies
Using cached copies at search time
4.
mnoGoSearch HTML parser
Tag parser
Special characters
META tags
Links
Comments
5.
Storing mnoGoSearch data
SQL storage types
Various modes of words storage
Storage mode - single
Storage mode - multi
Storage mode - blob
Substring search notes
Cache mode storage
mnoGoSearch performance issues
MySQL performance
Post-indexing optimization
SearchD support
Why using searchd
Starting searchd
Oracle notes
Introduction
Compilation, Installation and Configuration
IBM DB2 notes
6.
Subsections
Categories
Tags
Tags in SQL version
7.
Languages support
Character sets
Supported character sets
Several languages in one database
UTF-8 mode
non-UTF-8 mode
Recoding
Recoding at search time
Character sets aliases
Document charset detection
Automatic charset guesser
Default charset
Default Language
Making multi-language search pages
How does it work?
Possible troubles
Segmenters for Chinese, Thai and Japanese languages
Japanese language phrase segmenter
Chinese language phrase segmenter
Thai language phrase segmenter
Multilingual servers support
8.
Searching documents
Using search front-ends
Performing search
Search parameters
Changing different document parts weights at search time
Using front-end with an shtml page
Using several templates
Advanced boolean search
How search handles expired documents
How to write search result templates
Template sections
Template operators
Includes in templates
Security issues
Designing search.html
How the results page is created
Your HTML
Forms considerations
Relative links in search.htm
Adding Search form to other pages
Relevancy
Ordering documents
Boolean search
Crosswords
Search queries tracking
Search results cache
Fuzzy search
Ispell
Synonyms
9.
Miscellaneous
Reporting bugs
Currently known bugs
Core dump reports
Using
libmnogosearch
library
udm-config
script
mnoGoSearch
API
Database schema
I.
Reference
I.
mnoGoSearch commands reference
AddType
-- associates file names or extensions with mime types
Affix
-- includes ispell affix file
Alias
-- associates master and mirror sites
AliasProg
-- calls external URL parser
Allow
-- allows to index defined URLs
AlnumFactor
-- this command is obsolete
AuthBasic
-- defines basic HTTP authorization user name and password
BaseFiles
-- this command is obsolete
BrowserCharset
-- defines browser charset
Cache
-- enables or disables cache search results
Category
-- defines documents category
CheckMP3
-- checks for MP3 meta information
CheckMP3Only
-- check for MP3 meta information
CheckOnly
-- checks for file existence only
CrossWords
-- this command is obsolete
CVSIgnore
-- enables or disables indexing internal CVS files
DateFormat
-- defines date format
DBAddr
-- sets database address
DefaultLang
-- defines default language
DetectClones
-- enables or disables clone detection
Disallow
-- disallows indexing defined URLs
DocMemCacheSize
-- this command is obsolete
DocTimeOut
-- defines maximal time for document downloading
DoStore
-- this command is obsolete
ExcerptSize
-- defines maximal length of excerpt
ExcerptPadding
-- defines excerpt padding length
FlushServerTable
-- flushes server.active to inactive
FollowSymLinks
-- dereference or not symlinks
ForceIISCharset1251
-- assume windows-1251 charset
GroupBySite
-- this command is obsolete
GuesserUseMeta
-- enables or disables using meta tags
HlBeg
-- configures search results highlighting
HlEnd
-- configures search results highlighting
HoldBadHrefs
-- defines timeout for holding bad URLs
HrefOnly
-- scan HTML pages only for URLs
HTDBAddr
-- HTDBAddr
HTDBDoc
-- HTDBDoc
HTDBLimit
-- HTDBLimit
HTDBList
-- HTDBList
HTTPHeader
-- adds desired headers in indexer HTTP request
ImportEnv
-- imports given environment variable
Include
-- includes additional configuration file
Index
-- prevents indexer from storing words into database
IndexIf
-- allows documents, which sections match given argument, to be indexed
IspellCorrectFactor
-- this command is obsolete
IspellInCorrectFactor
-- this command is obsolete
IspellUsePrefixes
-- allows to use ispell prefixes while searching
LangMapFile
-- loads language map for charset and language guesser
LangMapUpdate
-- no description available yet
Limit
-- this command is obsolete
Listen
-- binds searchd to specified IP address
LoadChineseList
-- loads Chinese word frequency list
LoadThaiList
-- loads Thai word frequency list
LocalCharset
-- defines local charset
LogsOnly
-- this command is obsolete
MaxDocSize
-- defines maximal document size
MaxHops
-- defines maximal way in "mouse clicks"
MaxNetErrors
-- defines maximal network errors
MaxWordLength
-- defines maximal word length
Mime
-- defines external parser for given mime-type
MinWordLength
-- defines minimal word length
MirrorHeadersRoot
-- defines root directory of mirrored document's headers
MirrorPeriod
-- defines period for mirrored files
MirrorRoot
-- defines root directory to enable sites mirroring
NetErrorDelayTime
-- defines document processing delay
NewsExtensions
-- enables news extensions
NoIndexIf
-- disallows documents, which sections match given argument, to be indexed
NumberFactor
-- this command is obsolete
NumSections
-- this command is obsolete
OptimizeInterval
-- this command is obsolete
OptimizeRatio
-- this command is obsolete
ParserTimeOut
-- defines amount of time for parser execution
Period
-- defines reindex period
PopRankFeedBack
-- calculates sites weights
PopRankShowCntRatio
-- PopRankShowCntRatio
PopRankShowCntWeight
-- PopRankShowCntWeight
PopRankSkipSameSite
-- skips links from same site
PopRankUseShowCnt
-- PopRankUseShowCnt
PopRankUseTracking
-- PopRankUseTracking
Proxy
-- defines HTTP proxy address
ProxyAuthBasic
-- defines HTTP proxy user name and password
R0 - R9
-- sets random number
ReadTimeOut
-- defines stalled connections timeout
Realm
-- works almost like Server command
RemoteCharset
-- defines default character set for next Server command(s)
ResultsLimit
-- ResultsLimit
ReverseAlias
-- ReverseAlias
Robots
-- allows using robots.txt
Section
-- defines document's section
Server
-- describes web-space you want to index
ServerTable
-- loads servers from database
ServerWeight
-- defines server's weight
Spell
-- loads ispell file
StopwordFile
-- loads stopwords file
StoredFiles
-- this command is obsolete
Subnet
-- Subnet
Suggest
-- Display misspelled search word suggestions
Synonym
-- loads synonyms file
SyslogFacility
-- sets syslog facility
Tag
-- generic grouping tag
URL
-- inserts URL into database
URLDAddr
-- this command is obsolete
URLDataFiles
-- this command is obsolete
URLSelectCacheSize
-- sets URLs cache size for indexer
UseCRC32URLId
-- enables generation CRC32 URL IDs
UseRemoteContentType
-- specifies if the indexer should get content type from server
VarDir
-- defines mnogosearch var directory
VaryLang
-- defines languages for multilingual indexing
wf
-- wf
WordCacheSize
-- defines maximal in-memory words cache size
WrdFiles
-- this command is obsolete
Index
List of Tables
3-1.
Verbose levels
7-1.
Language groups
7-2.
Charsets aliases
8-1.
Available search parameters
9-1.
server
table schema
9-2.
Server's several parameters values in
srvinfo
table
Next
Introduction