A keyword partitioned index requires that the list of index entries for each keyword peer approach discussed in section 2 3 with a flexible parameter r However
Previous PDF | Next PDF |
[PDF] A computer program for creating keyword indexes to textual data files
This keyword index program can be used to create either KWIC or KWOC indexes parameter on the DATA DD card will also need to be changed to reflect the
[PDF] News-Oriented Automatic Chinese Keyword Indexing
Thus, automatically indexing keywords from text Keyword indexing can also be called keyword ex- traction date according to its feature values and choose
[PDF] A Keyword-Set Search System for Peer to Peer Networks - PDOS-MIT
A keyword partitioned index requires that the list of index entries for each keyword peer approach discussed in section 2 3 with a flexible parameter r However
[PDF] Spatial Keyword Query Processing - VLDB Endowment
This makes it difficult to determine which indexing technique best supports arguments, where q ψ is a set of keywords, q ρ is a spatial query point, and q k is
[PDF] Automatic indexing - CORE
In the experiment of this thesis the values were computed only to sin- index terms (or descriptors or keywords) are the elements of the indexing language
[PDF] Text Analysis and Automatic Indexing - SIGIR
tent identifiers, known as index terms, keywords, or descriptors, are repre ness of retrieval by using two parameters for each search known respectively
[PDF] CHAPTER 4 Indexing Procedures The function of indexing - SIGIR
It is assumed that all the keywords constitu- parameter determining precision in index performance parameters in indexing, exhaustivity and ^specificity
[PDF] Evaluating Spatial-Keyword Queries on Streaming Data - UCR CS
different spatial-keyword indexing techniques on streaming data, which is an essential index cell size, query weighting parameters, and number of Boolean
[PDF] Keywords and phrases: ima g e c lassific ation , topolo g ic al
[PDF] keywords of the alphabetical index - Venice Commission
[PDF] Keywords Rank Mobile Rank URL Last Check kfz reperatur graz 1 1
[PDF] Keyy yo optimis se la VoIP pour l`acc cès Intern Commu net
[PDF] Keyyo Business annonce le lancement de ses offres d`accès SDSL
[PDF] Keyyo Business lance la 1 offre de téléphonie SIP en marque blanche - France
[PDF] Keyyo conclut avec Ingram Micro un accord pour la distribution de
[PDF] Keyyo publie ses API de convergence téléphonie
[PDF] Keziah Jones (NG/UK)
[PDF] Keziah Jones - jazz à carthage 2009 - France
[PDF] Keziah Jones - Taj Express
[PDF] KEZO AMOUR LE PARFUM 50 ml VAPO PARFUM:75€ (BOTELLA - Anciens Et Réunions
[PDF] KF 1280 ND
[PDF] KF Entry List
AKeyword-SetSearchSystemforPeer-to-Peer
Networks
byOmprakashDGnawali
Science
attheMASSACHUSETTSINSTITUTEOFTECHNOLOGY
June2002
c inwholeorinpart.May28,2002
M.FransKaashoek
ProfessorofComputerScienceandEngineering
ThesisSupervisor
ArthurC.Smith
2 byOmprakashDGnawali
onMay28,2002,inpartialful¯llmentofthe requirementsforthedegreeofAbstract
ThesisSupervisor:M.FransKaashoek
3 4Acknowledgments
especiallyhissuggestionforexperiments. forbootstrappingmyeducation. 5 6Contents
1Introduction11
2RelatedWork17
3KSSDesignOverview25
74UsingKSS35
5SystemArchitecture43
6ExperimentalFindings49
87FutureWorkandConclusions61
9 10Chapter1
Introduction
algorithms. 11¯lesattractive.
nodes.1.1DistributedInvertedIndexing
12 thepositionfortheword. isunhelpfulinmakingaP2Psearche±cient. 131.2KSS:Keyword-SetSearchSystem
queryaredoc1anddoc2. 14 requirenosigni¯cantcommunication. canbeusedfor¯letransfers.1.3Contribution
1.4ThesisOverview
15 futureworkinchapter7. 16Chapter2
RelatedWork
2.1Napster
Client
Client
Client
Client
Metadata
uploadMetadata upload FileFile download
downloadQueryResponseNapsterMetadata
uploadServer
downloadedfrompeers. 172.2Gnutella
Peer Peer Peer Peer Peer Peer PeerQueryQuery
QueryQueryQuery
QueryResponse
ResponseDownload
18 highbandwidth.2.3FastTrackP2PStack
19Supernode
Supernode
Peer1Peer2Peer3
Supernode
Query: File2b
MetadatauploadMetadata
uploadPeer2: File2a, File2bPeer3: File3a, File3b
GET File File2b
File2a, File2bFile3a, File3b
Answer: Peer2
anddownloadsaredonefromthepeers resultsinworsequerylatency.Grokster[10]areFastTrackapplications.
2.4JXTASearch
20 inJXTA.2.5IterativeDeepeningandDirectedBFS
2.6LocalIndices
21withinD¡rhopsstillneedtobequeried.
2.7RoutingIndices
documentswiththeleastnumberofhops.2.8DiscoverQueryRouting
222.9DistributedJoinusingBloomFilters
Server sAServer sB
Client
(1) RequestA : 1 2 3 4B : 3 4 5 6(2) F(A)
1 2 3 4
3 4 6
(3) B F(A) U (4) A BU 3 4 N 2324
Chapter3
KSSDesignOverview
entriesarefetched. whenadocumentisinsertedintothenetwork. 25availabletothepeers. andChord. keywordsinthequery.
3.1KSSdistributedindex
oftheindex. 26Key hash(AB) hash(BC) hash(MN)URL http://18.175.6.2/a.mp3 http://18.175.6.2/a.mp3 http://18.175.6.2/b.html
Figure3-1:StructureoftheKSSindex
n-worddocumentisgivenbyC(n;k)=n! 273.2KSSExample
indextoreplytouserqueries. wordset:cosumed. N keyformedbyhashingABCD.
3.3Storingmetadataintheindexentry
29hash(AB)http://18.175.6.2/a.mp3 http://18.175.6.2/a.mp3hash(BC) hash(MN)http://18.175.6.2/b.htmlKey
X Y ZMetadata
X Y ZO P Q RURL
N matchingindexentries. 303.4KSSProperties
KSSsystem.
3.4.1InsertOverhead
asmallinsertoverhead.QueryOverhead
31them. q nextkwordsinthequery.Thus,thereareq khops,henceaboutqklistsneedtobe fetchedfromacrossthenetwork.
StorageOverhead
32entry.
KSSOverheadwithmetadata
regardlessofthequeryalgorithm. 3334
Chapter4
UsingKSS
adaptittospeci¯capplicationneeds.4.1Meta-dataSearch
ineachentry. onmorethantwokeywordsmoree±cient. 35thelistofentrieswiththematchingkeys. entriese±cient.
¯eldID,metaID):
word[0..n]=meta-datafield for(i=0;i¯lesystemsuchasCFS.
searchapplication.4.2Full-textindexing
documents. 37unlimitedw. threewords:A,BandC. orACthancontainanyofA,B,orC. 38
4.3KSSCon¯guration
4.3.1ContentsofIndexentry
39spaceblowupistoomuch.
4.3.3Storagerequirement
40forwardingthequerytoonenode.
4.3.4Documentranking
theonesappearinginthemiddleofaparagraph. 4142
Chapter5
SystemArchitecture
ChordDHashKSS
ChordDHashKSS
ChordDHashKSS
5.1TheChordLayer
43O(logN).
5.2TheDHashLayer
IDasthekey.
44ThefollowingisasummaryofDHashAPI:
key. anewentrywiththegivenkeyanddatavalue.5.2.1Availability
5.2.2Caching
fulllookup.5.2.3LoadBalance
45alotofbandwidthanddiskspacetospare.
5.3KSSLayer
query. entries,andstoretheminthenetwork. andreturntheintersectedlistofdocuments.5.4ImplementationStatus
supportsthefollowingcommands: 4647
48
Chapter6
ExperimentalFindings
preliminary¯ndingsfromourexperiments.6.1E±ciencyinFull-textsearch
6.1.1AnalysisMethodology
490 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
010002000300040005000
Percentage of pages
Number of index entries generated per page
Standard Inverted Index
KSS with window size of five
KSS with window size of ten
scheme. doc1)>. theactualusersontheLCSwebsite. 500 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
02004006008001000120014001600
Percentage of files
KB transferred during a query
Standard Inverted Index
KSS with window size 5
6.1.2InsertOverhead
510 50
100
150
200
250