Joint Optimization of Wrapper Generation and Template Detection  thumbnail
slide-image
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Joint Optimization of Wrapper Generation and Template Detection

Published on Sep 14, 20074071 Views

Many websites have large collections of pages generated dynamically from an underlying structured source like a database. The data of a category are typically encoded into similar pages by a common sc

Related categories

Chapter list

Joint Optimization of Wrapper Generation and Template Detection00:03
Outline00:39
Motivations01:05
Related Work02:05
Problems (cont.) (1)02:59
Problems (cont.) (2)03:13
Problems (1)03:45
Problems (2)04:10
Our Proposed Approach04:41
Problem Definition06:19
System Overview06:49
Wrapper Generation [6, 4, 18]07:42
Wrapper-DOM.Distance08:09
Wrapper-Oriented Page Clustering (WPC)08:45
Outline10:50
Experiments10:51
Effectiveness Test11:26
WPC with Different Thresholds13:11
Stability Test14:18
Demo!14:55
Conclusion18:08
Thanks!19:11