framework for location-aware search engineframework for location-aware search engine pasi fränti...
TRANSCRIPT
![Page 1: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/1.jpg)
Framework for location-aware search engine
Pasi Fränti17.1.2019
A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search engine", Journal of Location Based Services, 11 (1), 50-74, November 2017.
![Page 2: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/2.jpg)
Mopsi
![Page 3: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/3.jpg)
Mopsi overview
![Page 4: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/4.jpg)
Data collection in Mopsi
MOPSI webpage
wwwwww
Service directory
GPS
User collection
Other users:
Data collector:
Last skiing of winterN 62.63 E 29.86
User: Pasi
![Page 5: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/5.jpg)
Four aspects of relevance
Last skiing of winterDate: 4.4.2010Location: N 62.63 E 29.86
User: Pasi
• Text description• Keywords (tags)
• User profile• Social network
• Recency of data• Season (not relevant in July)
1. Content
2. Time
3. Location
4. User and his network
• Distance to user
Arppentie 5, Joensuu
P. Fränti, J. Chen, A. Tabarcea Four aspects of relevance in location-based media: content, time, location and network“ Int. Conf. on Web Information Systems & Technologies (WEBIST), 2011
![Page 6: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/6.jpg)
Mopsi search
![Page 7: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/7.jpg)
General workflow
.
.
.
User input Web mining Formatted output
Distance from user
meta search engine
![Page 8: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/8.jpg)
System architecturemeta search engine
Generic Search engine
![Page 9: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/9.jpg)
Location
![Page 10: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/10.jpg)
Location hierarchy
Country Finland
City Joensuu
Address Länsikatu 15, 80110
Location 62.59, 29.74
Geocoding Reverse geocoding
![Page 11: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/11.jpg)
Levels of location
Location 62.59, 29.74
Länsikatu 15 Science Park
Joensuu Finland
![Page 12: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/12.jpg)
Location in web pageAddress tag or geo-tag:<META name="geo.position" content="62.35; 29.44">
• <0.1% of Finnish websites used geo-tags in 2004 [Vänskä 2004]
• <1% of the websites related to the Oldenburg, Germany used explicit localization in 2008 [Ahlers and Boll, 2008]
• 7% of Mopsi service websites in May 2015
Postal address:• Most service websites have address
![Page 13: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/13.jpg)
Parsing web page
![Page 14: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/14.jpg)
Content of Web Page Hypertext Markup Language (HTML, XHTML)
Logo image
Navigation bar
Title
ImagesKeywords
Text
![Page 15: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/15.jpg)
blue links
<A>
red tables
<TABLE> <TR> <TD>
green dividers
<DIV>
violet images
<IMG>
yellow
forms
<FORM> <INPUT> … orange
linebreaks
<BR> <P>
blockquotes <BLOCKQUOTE>
black the root node
<HTML>
gray All other tags
DOM tree
![Page 16: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/16.jpg)
<html>
<body>
<table> <td>
<tr> <div>
<table>
<tr><td>PizzaPojat Niinivaara
Niinivaarantie 19
80200 Joensuu013 ‐
137 017
<br/>
<div>
<table align="center“><tr><td><div id="footerleft"><h3>PizzaPojat Niinivaara</h3><p>Niinivaarantie 19</p><p>80200 Joensuu</p><br /><p>013 ‐
137 017</p>
</div><td>
</tr></table>
Another example of DOM tree
![Page 17: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/17.jpg)
Web site functionality
![Page 18: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/18.jpg)
Single service
![Page 19: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/19.jpg)
Service directory
MultipleServices
![Page 20: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/20.jpg)
Bosbor kebab
Fiesta
Miami
Structure in the DOM tree
![Page 21: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/21.jpg)
Detecting function of the web page
Search engine
Pre-filter Discard Non-service
Service
Website Classifier
Single service Brand Service directory
Www
N. Gali, R. Mariescu-Istodor and P. Fränti, "Functional Classification of Websites" Int. Symposium on Information and Communication Technology (SoICT), Nha Trang, Vietnam, 34-41, December 2017
![Page 22: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/22.jpg)
Address detection:
![Page 23: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/23.jpg)
Address detection
Addresses
![Page 24: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/24.jpg)
DOM tree with address
![Page 25: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/25.jpg)
Detecting address from web• Analysis of text content of web page• Matching strings with address database• Address database stored as prefix tree• Both street number and postal code required
![Page 26: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/26.jpg)
Source of addresses in Mopsi
• Gazetteer for Finland• OpenStreetMap address data for the rest of world
![Page 27: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/27.jpg)
Address matching using Gazetteer
Kaislakatu 8, 80130, Kanervala, Joensuu, FinlandTorikatu 25, 80100 Joensuu, FinlandParppeintie 6, 82900 Ilomantsi, FinlandAleksanterinkatu 25, 15140 Lahti, FinlandVene 18, 10140 Tallinn, EstoniaCarrer de la Marina, 266-270, Barcelona, Spain2 Rue Pasteur, 06500 Menton, FrancePulchowk Rd, Lalitpur 44600, Nepal20 Chả
Cá, Hàng Đào, Hoan Kiem District, Hanoi, Vietnam
East Coast Park Service Road 1, Singapore
![Page 28: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/28.jpg)
Statistics of prefix trees
![Page 29: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/29.jpg)
Result of address detection
![Page 30: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/30.jpg)
Title extraction:
![Page 31: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/31.jpg)
N. Gali, R. Mariescu-Istodor and P. Fränti, "Using linguistic features to automatically extract web page title", Expert Systems with Applications, 79, 296-312, 2017.
N. Gali and P. Fränti, "Content-based title extraction from web page", Int. Conf. on Web Information Systems & Technologies (WEBIST'16), Vol.2, 204-210, Rome, Italy, April 2016.
Two methods
Method A: Title Tag Analyzer (TTA)
Method B: Titler
![Page 32: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/32.jpg)
Web Page Title
• Title Tag (91 %)
• Logo image (89 %)
• Web page body (93 %)
<title>Wentworth House Hotel Bath Hotels - Cheap Hotels in Bath, Somerset, UK</title>
The title can be in three different places:
![Page 33: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/33.jpg)
Title and Meta Tags
The obvious source
But includes also additional information
<title> Piato Restaurant – 123 Blues Point Road, McMahons Point, Sydney | Visit Piato and experience the life & flavour of Europe. North Sydney Functions. North Sydney Restaurants.</title>
<title> Joensuu Keskusta | Intersport - Sport to the people </title>
Segmentation is needed! Joensuu KeskustaIntersportSport to the people
![Page 34: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/34.jpg)
The coronet
Extract title & meta tags from the page
Segment content by delimiters
Construct candidate list
Score candidate segments
Web page
1. Placement in title & meta tags
2. Popularity in header tags3. Position in the web link
Title
Workflow of method AN. Gali and P. Fränti, "Content-based title extraction from web page", Int. Conf. on Web Information Systems & Technologies (WEBIST'16), Vol.2, 204-210, Rome, Italy, April 2016.
![Page 35: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/35.jpg)
Qualitative Analysis of TTATitle Ground truth Content of Title tag Selected string
Correct 3 Weeds Hotel 3 Weeds Hotel | Unique Pub | Bars | Restaurant | Party Venue | Inner West Sydney
3 Weeds Hotel
Short Irish Channel Restaurant & Pub
Irish Channel - Restaurant & Pub | 500 H St NW DC (202) 216-0046
Irish Channel
Long Secret Garden Bed & Breakfast
Secret Garden Bed & Breakfast (formerly Whitegates Guest House), near Keynsham, Bristol: Rooms, Prices and Guest Information
Secret Garden Bed & Breakfast (formerly Whitegates Guest House)
No title Rio Pool Hot Tubs, hot tub hire, swimming pools, Bristol, Gloucester
swimming pools
Incorrect Slice and Dice Home | Prepared Food | Swansea | Slice and Dice UK
Swansea
![Page 36: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/36.jpg)
MethodRouge-1
Jaccard DicePrecision Recall F-score
Baseline (Title Tag) 0.71 0.33 0.41 0.44 0.54TitleFinder (Moham.et al. 2012) 0.35 0.47 0.37 0.37 0.43Styling (Changuel et al. 2009) 0.14 0.21 0.15 0.22 0.28TTA (Gali and Fränti 2016) 0.52 0.59 0.52 0.54 0.62
Results with Mopsi ServicesAnnotated titles
![Page 37: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/37.jpg)
Workflow of method BN. Gali, R. Mariescu-Istodor and P. Fränti, "Using linguistic features to automatically extract web page title", Expert Systems with Applications, 79, 296-312, 2017.
![Page 38: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/38.jpg)
Content of text nodes N-grams (n=1…6) Filter by part-of-speech (POS) patterns
Representative title
![Page 39: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/39.jpg)
Navigation
Feeling Social? Find us on
Sydney Waterfront Restaurant Restaurant Milsons Point
Aqua Dining offers a quintessential Sydney dining experience with unrivalled harbour views that sweep from Luna Park to the world famous Sydney Harbour Bridge and the Sydney Opera House.
NNP
NNP
NNP
NNP NNP NNP NNP NNPS NN
NNP NNP VBZ DT JJ NNP NN NN
NNJJIN NNS WDT NN IN NNP NNP IN DT
NN JJ NNP NNP NNP DTCC NNP NNP
NNP
VBG VB PRP IN
POS tagging of phrases
NNP=Proper noun, singular NNPS=Proper noun, pluralNN=Noun, singular or massVBG=Verb, gerundVB=Verb, base formPRP=Personal pronounDT=DeterminerCC=Coordinating conjunction JJ=Adjective
![Page 40: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/40.jpg)
Comparison Mopsi services
Method A
Method B
![Page 41: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/41.jpg)
What about logo images?
~89 % of web pages have their title within a logo image
Needs to detect logo image
Apply OCR
Challenging !!!
![Page 42: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/42.jpg)
Representative image:
N. Gali, A. Tabarcea, and P. Fränti, "Extracting representative image from web page", Int. Conf. on Web Information Systems & Technologies (WEBIST'15), 411-419
Lisbon, Portugal, May 2015.
![Page 43: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/43.jpg)
Banner
Logo
Formatting
Representative
Icons Advertisement
Image categories
![Page 44: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/44.jpg)
Extract images
Web page link
Categorize
Analyze
Rank
Representative image
Images found:
Web page
Overall extraction process
![Page 45: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/45.jpg)
src http://www.ravintolakreeta.fi///images/banner.jpg
alt --title --from cssformat jpgwidth 945height 202size 190,890 pxaspect ratio 4.67parent tag <div>class header
Image features used
![Page 46: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/46.jpg)
Category Features KeywordsRepresentative Not in other category
Logo logoBanner Ratio > 1.8 Banner, header,
Footer, buttonAdvertisement Free, adserver, now,
buy, join, click, affiliate, adv, hits, counter
Formatting and Icons Width < 100 pxHeight < 100 px
Background, bg, spirit, templates
Summary of the rules
![Page 47: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/47.jpg)
Mopsi WebIma dataset
Summary of data collected:
Websites: 1002Images: 2363 Per page: Min=1, Average=2.36, Max=154Collection details:Who: 117 volunteersWhen: September 2014What: Pages of own choice or Mopsi searchHow: Select 1-3 most representative imagesIssues: Some level of subjectivity unavoidable
http://cs.uef.fi/mopsi/data/
![Page 48: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/48.jpg)
Results summary
Accuracy Extracted Images
WebIma 64% 99%
Google+ 48% 92%
Facebook 39% 90%
• Lightweight method suitable for real time applications
• Unsupervised: No training, no user feedback needed
• In use in MOPSI: Search and Service upgrade
![Page 49: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/49.jpg)
Recommendation system
![Page 50: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/50.jpg)
Mopsi search
Keyword search Recommendation(no keywords)
User location
![Page 51: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/51.jpg)
Location-aware recommendation
Results
Press here
Location
Input:• User• Location• Time• Keyword (optional)
Recommendations:• Nearby services• Photos of other users
![Page 52: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/52.jpg)
Industrial zoneRahkeentie Kuurnankulma
740 m
306 m762 m
Kuurnankulma
Vilkku kahvio
Vilkku kahvio
Heinosen leipomo
![Page 53: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/53.jpg)
K. Waga, A. Tabarcea and P. Fränti, "Recommendation of points of interest from user generated data collection", IEEE Int. Conf. on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom'12), Pittsburgh, USA, 2012.
Solutions for recommendation
Recommendation:• User statistics• Location• Time
User network:• Similarity of users• Local knowledge
P. Fränti, K. Waga, and C. Khurana, "Can social network be used for location-aware recommendation?", Int. Conf. on Web Information Systems & Technologies (WEBIST'15), 558-565, Lisbon, Portugal, May 2015.
![Page 54: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/54.jpg)
Conclusions
Key challenges:• Detecting location and text summary
Is it effective?• 40% of websites contain useful location
When it works?• GOOD: Service web page• NOT SO GOOD: Blogs, news stories…
![Page 55: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/55.jpg)
1. A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search engine", Journal of Location Based Services, 11 (1), 50-74, November 2017.
2. N. Gali, R. Mariescu-Istodor and P. Fränti, "Using linguistic features to automatically extract web page title", Expert Systems with Applications, 79, 296-312, 2017.
3. N. Gali, R. Mariescu-Istodor and P. Fränti, "Functional Classification of Websites“, Int. Symposium on Information and Communication Technology (SoICT), Nha Trang, Vietnam, 34-41, December 2017
4. N. Gali, R. Mariescu-Istodor and P. Fränti, "Similarity measures for title matching", IAPR Int. Conf. on Pattern Recognition, (ICPR'16), Cancun, Mexico, 1549-1554, December 2016.
5. N. Gali and P. Fränti, "Content-based title extraction from web page" , Int. Conf. on Web Information Systems and Technologies (WEBIST 2016), Rome, Italy, vol. 2, 204-210, April 2016.
6. M. Rezaei, N. Gali, and P. Fränti, "ClRank:a method for keyword extraction from web pages using clustering and distribution of nouns", IEEE/WIC/ACM Int. Joint Conf. on Web Intelligence and Intelligent Agent Technology (WI- IAT), 79-84, December 2015.
7. P. Fränti, K. Waga, and C. Khurana, "Can social network be used for location-aware recommendation", Int. Conf. on Web Information Systems & Technologies (WEBIST'15), 558-565, 2015.
8. N. Gali, A. Tabarcea, and P. Fränti, "Extracting representative image from web page", Int. Conf. on Web Information Systems & Technologies (WEBIST'15), 411-419, 2015
9. K. Waga, A. Tabarcea and P. Fränti, "Recommendation of points of interest from user generated data collection", IEEE Int. Conf. on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom'12), Pittsburgh, USA, 2012.
10. P. Fränti, J. Chen, A. Tabarcea, Four aspects of relevance in location-based media: content, time, location and network“ Int. Conf. on Web Information Systems & Technologies (WEBIST), 2011
11. A. Tabarcea, V. Hautamäki, P. Fränti, "Ad-hoc georeferencing of web-pages using street-name prefix trees", Int. Conf. on Web Information Systems & Technologies (WEBIST'10), Valencia, Spain, vol.1, 237-244, April 2010.
Publications
![Page 56: Framework for location-aware search engineFramework for location-aware search engine Pasi Fränti 17.1.2019 A. Tabarcea, N. Gali and P. Fränti, "Framework for location-aware search](https://reader033.vdocument.in/reader033/viewer/2022042712/5f95bc9c44ad8d30c6349412/html5/thumbnails/56.jpg)
1. Radu Mariescu-Istodor, “Efficient management and search of GPS routes”, PhD thesis, School of computing, Univ. Eastern Finland, August 2017.
2. Najlaa Gali, “Summarizing the content of web pages”, PhD thesis, School of computing, Univ. Eastern Finland, June 2017.
3. Mohammad Rezaei, “Clustering validation”, PhD thesis, School of computing, Univ. Eastern Finland, June 2016.
4. Karol Waga, ”Processing, analysis and recommendation of location data”, PhD thesis, School of computing, Univ. Eastern Finland, June 2015.
5. Andrei Tabarcea, “Location-based web search and mobile applications”, PhD thesis, School of computing, Univ. Eastern Finland, 2014.
6. Minjie Chen, “Efficient processing and compression of map images and routes”, PhD thesis, School of computing, Univ. Eastern Finland, August 2012.
7. Qinpei Zhao, “Cluster validity in clustering methods”, PhD thesis, School of computing, Univ. Eastern Finland, June 2012.
PhD theses