[PDF] Lexical Query Paraphrasing for Document Retrieval





Previous PDF Next PDF



LA LIMITE DES THEORIES DE COURBES GENERIQUES

Pour paraphraser [SHM 1984] : les questions résolues dans cet article ont été posées par le premier et le troisi`eme auteurs; pour ce qui est de la.



Effective Paraphrasing

University of Chicago Press 2018)



QuillBot as an online tool: Students alternative in paraphrasing and

1 nov. 2021 QuillBot is an online application to paraphrase writing avoid plagiarism



Paraphrasing and Quoting Responsibly

28 mars 2017 example a complex description of how internet use has evolved over many years. Paraphrase when a source's passage is complex or written in ...



Summarising and paraphrasing

6 oct. 2020 In assignments paraphrasing is a good way to avoid quoting too much ... the main content words are still there (social networking online



Using Internet based paraphrasing tools: Original work patchwriting

the broad range and availability of online paraphrasing tools which offer free. 'services' to paraphrase large sections of text ranging from sentences 



Reddit Temporal N-gram Corpus and its Applications on Paraphrase

11 déc. 2016 As the use of social media is increasing Online Social Networks ... Paraphrase Identification (PI) and Semantic Similarity (SS) tasks of ...



Utilizing Online Paraphrasing Tools to Overcome Students

16 sept. 2021 Paraphrase; Difficulties in paraphrasing; Online Tools;. Literature Reviews. *Correspondence Address: drivoka@unj.ac.id. Abstract. One of the ...



Lexical Query Paraphrasing for Document Retrieval

for queries posed to the Internet. These are para- phrases where content words are replaced with syn- onyms. We use WordNet (Miller et al. 1990) and.





Paraphrasing Tool – Rephrase using online paraphrase tool

Paraphrasing tool is a word changer tool that used to change synonyms Rephrase words and generate unique sentences using online paraphrase tool







Paraphrase - Reformuler un texte - Editpad

Reformuler un texte par Editpad est un reformulateur de texte en ligne qui peut réécrire et paraphraser des phrases complètes des paragraphes et des essais 



Reformuler un texte - Meilleur outil de paraphrase - Paraphrasing Tool

Reformuler un texte est le meilleur outil de paraphrase pour la reformulation de texte et la réécriture d'essais Notre changeur de mots peut changer la 



Paraphrase Online - Best Free Paraphrasing Tool

Paraphrase Online is the best paraphrasing tool that helps students and writers to rephrase essays assignments articles Our paraphraser is 100 free



Reformulation de texte Outil Gratuit - Small SEO Tools

Ce paraphrase outil peut reformuler phrase et paragraphes en quelques secondes grammarly popup Une écriture brillante vous attend Grande écriture simplifiée



Outil de paraphrase(Reformuler un texte) - Pre Post SEO

Un outil de paraphrase est utilisé pour reformuler des phrases tout en conservant le sens original Cet outil de reformulation de texte fonctionne avec l'IA 



Free Paraphrasing Tool Rephrase Text Fluently with AI - Scribbr

Paraphrase text online for free and with no sign-up Upload any Microsoft Word document Google Doc or PDF into the paraphrasing tool



Reformuler Un Texte - Réécrire ou Reformulation de Phrase

Paraphraser c'est écrire une idée dans vos propres mots qui a été présentée ou expliquée par une autre personne plus tôt La paraphrase permet à une personne d 

  • Quel est le meilleur site pour paraphraser ?

    Rephrase.info est l'outil idéal pour créer des articles uniques. Il permet de reformuler des phrases en gardant le sens d'origine. Gr? à ce logiciel, il est possible de réécrire des essais uniques tout en étant précis. L'outil de paraphrase ne garde que les informations appropriées en supprimant les remplissages.
  • Comment paraphraser un texte en ligne ?

    Comment paraphraser en ligne ?

    1Tapez le texte dans la zone de saisie ou téléchargez un fichier.2Choisissez un mode de reformulation. Notre générateur de paraphrase a quatre modes : Aisance. Standard. Créative. Plus intelligent.3Cliquez sur le bouton soumettre et laissez cet outil de paraphrase faire le reste du travail. :)
  • Comment paraphraser sans plagier ?

    Comment éviter le plagiat en utilisant la paraphrase ?

    1« D'après cet auteur, … »2« Ainsi, selon cet auteur, … »3« Si nous reprenons les propos de l'auteur… »4« Cela revient à dire que… »5« Autrement dit… »6« Cet auteur semble dire que … »7« En d'autres termes… »8« Si l'on en croit les dires de cet auteur, … »
  • Pour paraphraser une source, il faut réécrire le passage sans modifier le sens du texte original. La paraphrase est une alternative à la citation (où vous copiez-collez les mots exacts de quelqu'un et les mettez entre guillemets).
Lexical Query Paraphrasing for Document Retrieval

IngridZukerman

SchoolofComputerScienceandSoftwareEng.

MonashUniversity

Clayton,VICTORIA3800

AUSTRALIABhavaniRaskutti

TelstraResearchLaboratories

770BlackburnRoad

Clayton,VICTORIA3168

AUSTRALIA

Abstract

Wedescribeamechanismforthegenerationof

thenusedtoranktheparaphrases.Weevaluated ourmechanismusing404querieswhoseanswers resideintheLATimessubsetoftheTREC-9cor- pus.Therewasa14%improvementinperfor- mancewhenparaphraseswereusedfordocument retrieval.

1Introduction

paraphraseswillmatcharelevantdocument. onyms.WeuseWordNet(Milleretal.,1990)and asaresultofqueryparaphrasing. ourevaluationandconcludingremarks.

CouncilgrantDP0209565.2RelatedResearch

Thevocabularymis-matchbetweenuserqueries

queryexpansion.Twocommontechniquesfor queryexpansionareblindrelevancefeedback (Buckleyetal.,1995;Mitraetal.,1998)and wordsensedisambiguation(WSD)(Mihalceaand mentsusingaquerygivenbyauser,andthencon- documents.WSDoftenprecedesqueryexpansion usedautomaticallyconstructedthesauri.

Theimprovementsinretrievalperformancere-

portedin(Mitraetal.,1998)arecomparableto disambiguationerrors. theaboveapproachesinthattheexpansionofa phrases.LikeHarabagiuetal.(2001),weuse

WordNettoproposesynonymsforthewordsin

whichwordstoparaphrase.Incontrast,weuse trievalprocess.

3Resources

onymsforthewordsineachquerywasobtained fromWordNet(Milleretal.,1990).

TimesportionoftheNISTTextResearchCollec-

tion(//trec.nist.gov).Thiscorpuswassmall formedforthedocumentsintheLATimescollec- obtainedfromWordNet(Milleretal.,1990). timesitappearsinthecorpus.Thelemma-pair ???thenumberoftimes ???appearsbefore ???in afive-wordwindowinthecorpus(notcounting maintainsadifferententryforthelemmapair naryowingtodiskspacelimitations.

4ParaphrasingandRetrievalProcedure

thefollowingsteps:

1.Tokenize,tagandlemmatizethequery.

2.Generatesynonymsforeachcontentlemmain

thequery(stopwordsareignored). entsynonymcombinations,computeascorefor eachparaphrase,andranktheparaphrasesac- cordingtotheirscore.Thelemmatizedquery plusthe19topparaphrasesareretained.

Documentsarethenretrievedforthequeryand

strainthenumberofsynonymsgeneratedforit.

Inordertodeterminetheeffectoftagginger-

allythewrongtags,andranoursystemwithboth

4.2Proposingsynonymsforeachword

ThefollowingtypesofWordNetsynonymswere

generatedforeachcontentlemmainaquery: synonyms,attributes,pertainymsand seealsos(Milleretal.,1990).1Forexample, accordingtoWordNet,asynonymfor"high"is anddonotgeneratesynonymsforpropernounsor stopwords.

4.3Paraphrasingqueries

inturn,andproposesasynonymfromthosecol-

Thesearequerieswhereallthewordsexceptone

arestopwordsorclosed-classwords.

4.4Computingparaphrasescores

Thescoreofaparaphraseisbasedonhowcommon

wouldberepresentedbyPr ditionalprobabilitiesareoftenused,e.g., Pr theinteractionbetweenalemma ???andonlyone oftheresultsinmostcases. abilities).Forinstance,if ??appears10timesinthe corpusand ???appears4times, (where ?isanormalizingconstant).Incontrast, if ???appears200timesinthecorpusand ???ap- pears30times, contributeahigherscoretotheparaphrase.

Toaddresstheseproblems,weproposeusingthe

yields posedoflemmas Pr Pr frequencies,yielding Pr ??freq? where ?isanormalizingconstant.2Sincethiscon- wedropitfromconsideration,anduseonlythe

Thus,ourparaphrasescoringfunctionis

4.4.1Experimentalparameters

Whencalculatingthescoreofaparaphraseus-

freq theorderof ??and ??(asitappearsintheparaphrase) shouldbeenforced;and(2)howtohandle ??pairs asexperimentalparametersofthesystem. 2???? forcetheorderof ???whencalculatingfreq? isdeterminedbytheweight?orderasfollows: freq ?freq? (3) wherefreq ??????isthefrequencyofthelemma- pair ??????when ??isfollowedby ??.?order ?order ???countsequallytheorderinthepara- weightsof0,1and0.5for ?order(Section6).

Absentlemma-pairs.Whenalemma-pairisnot

66millionpairs-hadafrequencyof1butwere

thedictionary.

4.5Retrievingdocumentsforeachquery

inthatforeachquery ?,weadjustthescoresofthe paraphrasesof ?(obtainedfromEquation2).Our

1.Foreachparaphrase

??of?(? #para?),where ???isthelemmatizedquery: (a)Extractthecontentlemmasfrom contentlemmasinparaphrase (b)Foreachlemma,computeascoreforthere- trieveddocumentsusingastandardIRmea- sure,e.g.,TermFrequencyInverseDocument

Frequency(TFIDF)(SaltonandMcGill,

1983).Lettfidf

????bethescoreof document???retrievedforlemma ).Whenadocument???isretrieved bymorethanonelemmainaparaphrase ?,itsTFIDFscoresareadded,yieldingthe score ???tfidf????? ????.Thisscoreindi- cateshowwell ???matchesthelemmasin paraphrase ??.Inordertotakeintoaccount theplausibilityof ??,thisscoreismultiplied by ???-thescoreof ??obtainedfrom

Equation2.Thisyields

??,thescoreof document ???forparaphrase ?????tfidf????? ????(4)

2.Foreachdocument

???,addthescoresfromeach paraphrase(Equation4),yielding para?????? tfidf???? ????(5)

Anoutcomeofthismethodisthatlemmas

performance(Section6).

5SampleResults

obtainedwith ?order ??,AbsAdjDiv=10,and werenotfoundinthedictionary. spitetheirlowoverallscoreandabsentlemma- forthelemmasinthisquery.Thesecondquery onymfor"invent"and"video"asasynonymfor rectparaphrases.Thethirdqueryisanextreme exampleofthisbehaviour,whereWordNetsyn-

Score#AbsParaphrase

WhoistheGreekGodoftheSea?

9.20E+020whobethegreekgodofthesea?

6.90E+001whobethegreekgodoftheocean?

5.00E-011whobethegreecegodofthesea?

1.00E-022whobethegreecedeityofthesea?

1.00E-022whobethegreecedivinityofthesea?

1.00E-022whobethegreeceimmortalofthesea?

1.00E-022whobethegreeceidolofthesea?

8.00E-032whobethegreekdeityofthesea?

8.00E-032whobethegreekdivinityofthesea?

8.00E-032whobethegreekimmortalofthesea?

8.00E-032whobethegreekidolofthesea?

Whoinventedtelevision?

7.00E+000whoinventtelevision?

1.60E+010whomanufacturetelevision?

1.60E+010whomanufacturevideo?

1.10E+010whomanufacturetv?

9.00E+000whoinventtv?

2.00E+000whodevisetelevision?

2.00E+000whoforgetv?

1.00E-021whoinventvideo?

1.00E-021whoinventtelly?

1.00E-021whocontrivetelevision?

1.00E-021whocontrivetv?

WhenwasBabeRuthborn?

6.06E+030whenbebaberuthbear?

3.39E+040whenbebaberuthpay?

1.97E+040whenbebaberuthstand?

1.09E+040whenbebaberuthhold?

2.42E+030whenbebaberuthcarry?

1.21E+030whenbebaberuthhave?

4.24E+021whenbebaberuthsupport?

9.09E+011whenbebaberuthexpect?

6.06E+001whenbebaberuthbrook?

6.06E+001whenbebaberuthwear?

3.03E-012whenbebaberuthdeliver?

Howtallisthegiraffe?

4.00E+000howtallbethegiraffe?

2.00E+000howlargebethegiraffe?

2.00E+000howbigbethegiraffe?

2.00E+000howhighbethegiraffe?

1.00E-011howgrandiloquentbethegiraffe?

1.00E-011howmagniloquentbethegiraffe?

1.00E-011howimprobablebethegiraffe?

1.00E-011howmarvelousbethegiraffe?

6Evaluation

ontheTRECLATimescollection,usingTREC task,sinceourultimategoalistoanswerques- where1105documentswerejudgedrelevantto48 ofthe50TREC-6keyword-basedqueries.

Ourresultsshowthatqueryparaphrasingim-

Forthequestionansweringtask,underthesame

wereretrievedimprovedfrom169to182. factorsonretrievalperformance. paraphrasegeneration). versusautomaticPoStagging(Brill,1992), whichtaggedcorrectly84%ofthequeries. ?Out-of-orderweight(?order)-howmuchwe shouldtakeintoaccountthewordorderina intermediate). muchweshouldpenalizelemma-pairsthatare adjacentinthequerybutabsentfromthecorpus (samepenaltyasnon-adjacentabsentlemma- pairs,alittlehigher,alothigher). ?Querylength-howthenumberofwordsinthe queryaffectsretrievalperformance.

1paraphrase(Set1),thenthequeryplus2para-

phrases(Set2),andsoon,uptoamaximumof trievalenginefrom1to20documents.

6.1WordNetCo-locations

05101520290

295
300
305
310
315
320
325
330
335

Number of paraphrases

Total number of correct documents

Correct Documents Vs Number of Paraphrases

Col

ColScore

NoCol paraphrases(20retrieveddocuments)

NoColandColScore.UndertheColsetting,our

mechanismcheckedwhetheralemma-pairinthe onymssuchas"folate"and"vitaminm"forthe

ColScoreisahybridsetting,whereWordNetwas

phrases,butnotforgeneratingthem. asafunctionofthenumberofparaphrasesina set(from0to19).Thevaluesfortheotherfac- torswere: ?order=1,AbsAdjDiv=2,andmanually- trievedwhenonlythelemmatizedquerywassub- bersofparaphrases,whenthemaximumnumberof

051015200

50
100
150
200
250
300
350

Number of retrieved documents

Total number of correct documents

Correct Documents Vs Number of Retrieved Documents

NoPara

Col

ColScore

NoCol retrieveddocuments(maximumparaphrases) trievedperquery(from1to20).AsforFigure1, moredocumentsareretrieved. as"vitaminm"and"vitaminbc",whichappeared "bigleague"inonly3ofthe19paraphrases)en- restofourevaluation.6.2Taggingaccuracy

ThePoS-taggerincorrectlytagged64ofthe404

mis-taggingwhichhadthelargestimpactonthe asotherPoSandviceversain24cases,andthe hadanounmis-taggedasanotherPoS.

6.3Out-of-orderweight

weight, ?order(Equation3):1,0and0.5.The thequery"howmanydogspullasledintheIdi- theweightoftheorderedpairs. theirorderinthecorpus.Thus,whenanordered

6.4Penaltyforabsentadjacentlemma-pairs

foreachabsentadjacentlemma-pair. tainedforAbsAdjDiv=10.quotesdbs_dbs30.pdfusesText_36
[PDF] comment utiliser op-cit et ibid

[PDF] quand mettre un alinéa mémoire

[PDF] citer un cours

[PDF] citer ses sources

[PDF] svt 3eme l homme face aux micro organismes controle

[PDF] différence entre bactérie et virus svt

[PDF] la diversité des microorganismes

[PDF] comment la constitution protège les droits fondamentaux

[PDF] en france comment les droits de l'homme sont ils protégés

[PDF] garantie des droits définition

[PDF] garantie des droits fondamentaux

[PDF] quels sont les droits fondamentaux

[PDF] les acteurs transnationaux dans les relations internationales

[PDF] isonomie

[PDF] l'organisation de la défense nationale schéma