Thursday, February 03, 2005

review questions

Today i had my project review . Some of the questions
that was asked by the guide are:

1) What are the attributes do u filter the mail.
List out the what are all the things u
check in order to consider to derive the conclusion of
spam and legitimate message.

2) If mail is the literal translation of the tamil
words wriiten in english then what do the filter do.
Whether it will consider it as spam or it will pass
through the filter.

Ans : ( I guess)

If the mail is written in english though the words
represent tamil words( ie literal translation of the
tamil sentences in english)l then each token will be
given the probability of 0.4 (as the tokens are all
new) according to bayesian filter concept. Thus the
mail will be considered as non spam mail.

3) As there is tamil sentence written in english.
There are chances that users may use different
spelling to represent the same word. Then what is the
case?

Ans : ( I guess)

There are possibility of only a few words to be
misspelled or spelled differently. At this case, while
calculating the combined probabilty there are chances
of leaving those words and thus due to calculation of
combined probability the mail will pass through the
filter.

4) bayesian filter is better than what other filters.
Say with reason.


--
O.R.Vaishnavi Devi
Thiagarajar College

________________________________________________________________________
Yahoo! India Matrimony: Find your life partner online
Go to: http://yahoo.shaadi.com/india-matrimony

1 Comments:

Blogger Senthil Kumaran said...

Vaish, very good!!! The answers make lot of sense.

February 3, 2005 at 12:32 PM  

Post a Comment

<< Home