Jack Sleight
2008-06-27 10:39:30 UTC
Hi,
I'm a little confused. I have a database from which I'm building a
search index. The database is in utf-8, and when I add fields to the
index I'm specifying utf-8 as the encoding. It builds the index fine
without any errors or warning. The index functions perfectly, except if
you search for a word containing letters with diacritics, eg.
"Führerschein", which returns no results. I've tried setting the
analyzer to Zend_Search_Lucene_Analysis_Analyzer_Common_Utf8, and I've
also tried setting the encoding in
Zend_Search_Lucene_Search_QueryParser::parse() to utf-8, but neither work.
Any ideas?
I'm a little confused. I have a database from which I'm building a
search index. The database is in utf-8, and when I add fields to the
index I'm specifying utf-8 as the encoding. It builds the index fine
without any errors or warning. The index functions perfectly, except if
you search for a word containing letters with diacritics, eg.
"Führerschein", which returns no results. I've tried setting the
analyzer to Zend_Search_Lucene_Analysis_Analyzer_Common_Utf8, and I've
also tried setting the encoding in
Zend_Search_Lucene_Search_QueryParser::parse() to utf-8, but neither work.
Any ideas?
--
Jack
Jack