algorithm to find hidden links in a web page
TRANSCRIPT
[1]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Under the guidance of
Mr. Indraneel Mukhopadhyay
ALGORITHM TO FIND HIDDEN LINKS IN A WEB PAGE
Presented by
Pradyut Kumar MallickRoll # IT200127292
[2]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Introduction
Hidden links are ones that real people aren’t supposed to actually notice or click on
Hidden links is a way to guide a search engine to our doorway pages.
New dynamic “hidden link” technique for linking a large highly connected graph in a simple hyperbolic space without cluttering the display.
[3]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
A cyclic hyperbolic space with hidden links
[4]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
In a hyperbolic space, the far away nodes/edges (paths) are diminished when the user is not focusing on them.
The user can dynamically warp the display to focus on thousands of different nodes for navigation.
This graph is a non-cyclic hierarchical hyperbolic structure without multiple connected paths.
A cyclic hyperbolic space with hidden links
[5]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
New Technique
The user can easily navigate through all possible paths without tracing many lines and intersections
Robot programs called spiders create search engine databases, computer robot programs that crawl the web seeking search engine content
Pages created as the result of a search are called "dynamically generated" pages .
[6]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
In a directed non-cyclic hierarchical space, there is a primary graph, which links all the nodes in a tree form. These links are primary tree links. The others are non-tree/cross links in a highly connected graph. A node can have one incoming primary link and many non-tree/cross links.
Definitions
[7]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Definition of Cyclic Hierarchical Space
[8]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Primary Path: (tree-link) “AE”
Secondary Path (non-tree/cross link) “AB
Hidden-Link Node
Primary Sub-Space Nodes
Secondary Sub-Space Nodes
Placeholder
Definition of Cyclic Hierarchical Space
[9]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Hidden Link States and Processing Flow
State 1: Idle State
State 2: Activate State
[10]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Hidden Link States and Processing Flow
State 3: Map/Unmap (move) State
State 4: Navigation State
[11]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Hidden Link States and Processing Flow
State 5: Reset
[12]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
“Hidden Link” Client-Server Web Structure
[13]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Code
The basic link tag looks something like <a href="hidden.html">click here</a>.
<a href="hidden.html" style="cursor:help">
<a href="hidden.html" style="color:#FF0080">
<a href="hidden.html" style="text-decoration:none">
Cursor Type …………. auto ……………crosshair ……………hand
[14]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Build hash table of links in the website.
Partition web log by visitor
For each visitor, partition web log file such that each subsequence terminates in a target page.
For each visitor and target page, find any expected locations for that page:
Algorithm
[15]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Website & Search Pattern of Hidden Links
[16]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Hidden Link Applications
CONTENT AND USAGE MINING
CUSTOMER INTERVIEW WEB SERVICE
[17]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
<div id="Links0" style="LEFT:0px;TOP:0px;
VISIBILITY:hidden; POSITION: absolute;">
<a href="index1.htm">hasdf hdkfh afhkj </a>
<a href="index2.htm">kjhf haksf hkasf </a>
<a href="index3.htm">kjhkjdf khdkf haf</a>
<a href="index4.htm">ghdf gdjf kgdf</a>
Related Work
[18]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Conclusion
The hidden link technique enables the mining
of large hierarchies with multiple secondary
paths
Hidden link enables the user to easily navigate
through different links without being
overwhelmed with large member of nodes and
paths.
[19]
Nati
onal In
stit
ute
of
Sci
en
ce &
Tech
nolo
gy
Algorithm to Find Hidden Links
Pradyut Kumar Mallick
Thank You!!