0
Verity & non-English categories

/t5/coldfusion-discussions/verity-amp-non-english-categories/td-p/601105
Mar 18, 2007
Mar 18, 2007
Copy link to clipboard
Copied
Hi there
I've attached some test code, which demonstrates what I'm doing. It looks a bit long-winded - it's 60-odd lines - and I'd usually not post that much, but I kind of have to here. It's pretty simple, so I hope you don't get frightened away by the apparent length of my post.
Note: to run the code, you'll need to D/L and install the Eastern European lanugage pack from here, first: http://www.macromedia.com/go/verity. Note the install instructions are wrong, and you should put the files in {verity}\k2\common, not in the {cf} install dir. For me the correct dir was C:\ColdFusionSearchService\k2\common.
I am trying to index some Russian-language data, which also has Russian-language categorisation. The basic indexing works fine: I can index and search for Russian text. However if I add Russian-language categories to my indexed records, I cannot search on those categories: I always get no results.
Oddly, if I change the language of the collection to English and search on the Russian-language categories, it "works", but all the Cyrillic data is munged. So "close but no banana" on that one.
I have to concede that today is the first time I've tried to use Verity in any language other than English, so I figure (hope!) I am just doing something daft.
Any thoughts?
--
Adam
I've attached some test code, which demonstrates what I'm doing. It looks a bit long-winded - it's 60-odd lines - and I'd usually not post that much, but I kind of have to here. It's pretty simple, so I hope you don't get frightened away by the apparent length of my post.
Note: to run the code, you'll need to D/L and install the Eastern European lanugage pack from here, first: http://www.macromedia.com/go/verity. Note the install instructions are wrong, and you should put the files in {verity}\k2\common, not in the {cf} install dir. For me the correct dir was C:\ColdFusionSearchService\k2\common.
I am trying to index some Russian-language data, which also has Russian-language categorisation. The basic indexing works fine: I can index and search for Russian text. However if I add Russian-language categories to my indexed records, I cannot search on those categories: I always get no results.
Oddly, if I change the language of the collection to English and search on the Russian-language categories, it "works", but all the Cyrillic data is munged. So "close but no banana" on that one.
I have to concede that today is the first time I've tried to use Verity in any language other than English, so I figure (hope!) I am just doing something daft.
Any thoughts?
--
Adam
TOPICS
Advanced techniques
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting.
Learn more
LEGEND
,
/t5/coldfusion-discussions/verity-amp-non-english-categories/m-p/601106#M55893
Mar 18, 2007
Mar 18, 2007
Copy link to clipboard
Copied
Ballocks.
For those using the newsfeed, this line:
<cfset sCategory = "ÀÞááØï"><!--- change this from "ÀÞááØï" to be something
in English, and it works fine --->
Might come out with question marks instead of the word "Russia" in Russian.
It reads OK on the web-UI.
--
Adam
For those using the newsfeed, this line:
<cfset sCategory = "ÀÞááØï"><!--- change this from "ÀÞááØï" to be something
in English, and it works fine --->
Might come out with question marks instead of the word "Russia" in Russian.
It reads OK on the web-UI.
--
Adam
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting.
Learn more
Community Beginner
,
LATEST
/t5/coldfusion-discussions/verity-amp-non-english-categories/m-p/601107#M55894
Mar 11, 2008
Mar 11, 2008
Copy link to clipboard
Copied
Would it possible to use some numeric values for the
categories and then 'translate' them? (e.g. store the russian
values in a unicode database mapped through the numeric
value)
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting.
Learn more

