Sunday, March 25, 2012

Changing the Language Noise Words

Hi,
I have searched high and low for an answer to this question. Please
tell me that it is a simple process...!
I have created a website in English using SQL 2000 as the database.
Now I have just launched the same website in French using exactly the
same schema (and same server) as the english website. Both use
full-text search. BUT...I notice that the noise word file being used
for both is English so, naturally, I need to change my French language
site to French. How can I do this?
I have already tried changing the database and field collation in the
French database and re-creating and re-populating the full-text index
but the English noise word file is still being used. Please, somebody
help!!
Darren,
You can switch to another language via dropping the existing FT Catalog and
re-creating it via the FT Indexing Wizard and when you pick the column (Text)
to be FT Indexed, click on the drop down box marked "Language for Word
Breaker" and select your language and then run a full population.
(Thanks to John for telling that trick!-)
Regards, Gerald.
"Darren Jensen" wrote:

> Hi,
> I have searched high and low for an answer to this question. Please
> tell me that it is a simple process...!
> I have created a website in English using SQL 2000 as the database.
> Now I have just launched the same website in French using exactly the
> same schema (and same server) as the english website. Both use
> full-text search. BUT...I notice that the noise word file being used
> for both is English so, naturally, I need to change my French language
> site to French. How can I do this?
> I have already tried changing the database and field collation in the
> French database and re-creating and re-populating the full-text index
> but the English noise word file is still being used. Please, somebody
> help!!
>
|||Darren,
Garald is correct. Just some additional info for future reference... You can
find out which noise.* is linked to what language via the Schema.txt file
under \FTDATA\SQLServer\Config, for example for noise.FRA is the noise word
file for French:
<stoplist
language="French_French"
file="noise.fra"
primarylanguage=12
sublanguage=1>
Regards,
John
"Gerald Baeck" <GeraldBaeck@.discussions.microsoft.com> wrote in message
news:209E46D0-5BEE-4531-8145-A9B8DEDFCF50@.microsoft.com...
> Darren,
> You can switch to another language via dropping the existing FT Catalog
and
> re-creating it via the FT Indexing Wizard and when you pick the column
(Text)[vbcol=seagreen]
> to be FT Indexed, click on the drop down box marked "Language for Word
> Breaker" and select your language and then run a full population.
> (Thanks to John for telling that trick!-)
> Regards, Gerald.
> "Darren Jensen" wrote:
|||Thanks, that worked a treat and very easy to do too!
Gerald Baeck <GeraldBaeck@.discussions.microsoft.com> wrote in message news:<209E46D0-5BEE-4531-8145-A9B8DEDFCF50@.microsoft.com>...
> Darren,
> You can switch to another language via dropping the existing FT Catalog and
> re-creating it via the FT Indexing Wizard and when you pick the column (Text)
> to be FT Indexed, click on the drop down box marked "Language for Word
> Breaker" and select your language and then run a full population.
> (Thanks to John for telling that trick!-)
> Regards, Gerald.
|||I have one follow up question on the subject of fulltext indexing with
French and that is how can I get the search to ignore accents? For
example if I search 'hopital' I would like the results to show all
places where it finds 'hpital' or 'hopital'. Is this possible?
Thanks.
jensendarren@.hotmail.com (Darren Jensen) wrote in message news:<c2c1a066.0411171919.75ce4862@.posting.google. com>...[vbcol=seagreen]
> Thanks, that worked a treat and very easy to do too!
> Gerald Baeck <GeraldBaeck@.discussions.microsoft.com> wrote in message news:<209E46D0-5BEE-4531-8145-A9B8DEDFCF50@.microsoft.com>...
|||Darren,
Unfortunately, the issue of accent sensitivity vs. accent insensitivity FTS
is more difficult to answer. This is a well known bug that has been around
for a long time (since SQL Server 7.0) and is truly only going to be fixed
in SQL Server 2005 <sigh>. As there are only workarounds for this bug in SQL
Server 2000 that require the use of duplicate data where you store the
non-accented search words, such as 'hopital'. You would then FT Index and FT
Search this column, while returning the accented search data (such as
'hpital') from another column back to the user. You will need to develop
accent removal procedures as well as use triggers (insert & update) to
maintain the currency of the two columns...
Regards,
John
"Darren Jensen" <jensendarren@.hotmail.com> wrote in message
news:c2c1a066.0411222026.3278d8fe@.posting.google.c om...
> I have one follow up question on the subject of fulltext indexing with
> French and that is how can I get the search to ignore accents? For
> example if I search 'hopital' I would like the results to show all
> places where it finds 'hpital' or 'hopital'. Is this possible?
> Thanks.
> jensendarren@.hotmail.com (Darren Jensen) wrote in message
news:<c2c1a066.0411171919.75ce4862@.posting.google. com>...[vbcol=seagreen]
news:<209E46D0-5BEE-4531-8145-A9B8DEDFCF50@.microsoft.com>...[vbcol=seagreen]
Catalog and[vbcol=seagreen]
(Text)[vbcol=seagreen]
|||Darren,
my workaround for this problem is to replace all special chars before
filling the index. I know it could take a while, but till Yukon is released
its the only way i think.
UPDATE [table] SET [field] = REPLACE([field], '', 'o')
Regards, Gerald.
"Darren Jensen" <jensendarren@.hotmail.com> schrieb im Newsbeitrag
news:c2c1a066.0411222026.3278d8fe@.posting.google.c om...[vbcol=seagreen]
>I have one follow up question on the subject of fulltext indexing with
> French and that is how can I get the search to ignore accents? For
> example if I search 'hopital' I would like the results to show all
> places where it finds 'hpital' or 'hopital'. Is this possible?
> Thanks.
> jensendarren@.hotmail.com (Darren Jensen) wrote in message
> news:<c2c1a066.0411171919.75ce4862@.posting.google. com>...

No comments:

Post a Comment