I have some documents containing sequences, such as radio/testedthat I would like to return hits in queries like
select * from doc
where to_tsvector('english',body) @@ to_tsvector('english','radio')
Unfortunately, the parser by default accepts radio/testedas a token file(despite the fact that it is in the Windows environment), so it does not match the request above. When I run on it ts_debug, when I see that it is recognized as a file, and the token ends radio/tested, not two tokens radioand test.
Is there a way to configure the parser not to search for tokens file? I tried
ALTER TEXT SEARCH CONFIGURATION public.english
DROP MAPPING FOR file;
... but he did not change the conclusion ts_debug. If there is a way to disable, fileor at least recognize both file, and all the words that, in his opinion, make up the directory names along this path, or if there is a way to make it treat slashes as hyphens or spaces (without hitting regexp_replacetheir own performance ), which would be really useful.
source
share