A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://mail.python.org/pipermail/python-dev/2002-September/028507.html below:

[Python-Dev] The first trustworthy <wink> GBayes results

[Python-Dev] The first trustworthy <wink> GBayes resultsDelaney, Timothy tdelaney@avaya.com
Mon, 2 Sep 2002 10:38:10 +1000
> From: Delaney, Timothy [mailto:tdelaney@avaya.com]
>
> Whether any weighting should be applied to single words or 
> word pairs I
> don't know - my gut feeling is that they should be weighted 
> the same, but
> guts are no replacement for empirical evidence.

On second thought - if a word-pair appears, then the separate parts should
not be checked as separate words.

So, If I had scores:

    'free'              0.1
    'beer'              0.1
    ('want', 'free',)   0.9
    ('free', 'beer',)   0.01
    ('free', '!!!',)    0.99

then the following phrases would match (case-folding) as:

    'I want free beer!!!':

    ('want', 'free',)   0.9
    ('free', 'beer',)   0.01

    'Get *** for free!!!'

    ('free', '!!!',)    0.99

    'I want free beer. Free the beer!!!'

    ('want', 'free',)   0.9
    ('free', 'beer',)   0.01
    'free'              0.1
    'beer'              0.1

Damn I wish I was at home to try this out ... :(

Tim Delaney



RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4