• No results found

physics of the Web

N/A
N/A
Protected

Academic year: 2023

Share "physics of the Web"

Copied!
39
0
0

Loading.... (view fulltext now)

Full text

(1)

Physics of the

G Santhosh Kumar

Web

Cochin University

(2)

Birthday of a Giant

Whose slogan is this?

Stand on the shoulders of giant

(3)

Idea of PageRank

(4)

Anatomy of a Search Engine

Source:

http://infolab.stanford.edu/~backrub/google.html

(5)

Random Surfer model

The random surfer visits a web page with a certain probability which derives from the page's PageRank. The probability that the random surfer clicks on one link is solely given by the number of links on that page

the probability for the random surfer reaching one page is the sum of probabilities for the random surfer following links to this page

(6)

World’s largest

Eigen value Problem

Idea is to compute the

Principle Eigen vector of the system

(7)

Markov

matrix S is irreducible and

stochastic

S =

Google Matrix

The rank of each page can be generated iteratively

from the Google matrix using the power method

(8)

Google matrix of Wikipedia articles network, written in the bases of PageRank index; fragment of top 200 X 200 matrix elements is shown, total size N=3282257

People are interested in Spectrum and eigen states of G matrix

(9)

Towards Google matrix of …

Brain: The Google matrix G is constructed on the basis of neuronal network of a brain model DNA: Google Matrix Analysis of DNA Sequences

(10)
(11)

An old experiment

• Milgram in 1967

Any two strangers in the world are separated by an average of six

In 2008, a study by Microsoft showed that the average chain of contacts between users of its Messenger Service was 6.6 people

(12)

It’s Small World, after all

small diameter of the web means that all that information is just a few clicks away

(13)

Map of interacting

Proteins

(14)

Networks without scale

(15)

Random Graphs

Erdös-Rényi model (1960)

Connect with probability p

p=1/6 N=10

k ~ 1.5

(16)

Erdös-Rényi model

(17)

Erdös-Rényi model

(18)

World Wide Web

Over 3 billion documents ROBOT: collects all URL’s found in a document and follows them recursively

Nodes: WWW documents

Links: URL links

R. Albert, H. Jeong, A-L Barabasi, Nature, 401 130 (1999).

Expected

P(k) ~ k-γ Found

Scale-free NetworkExponential Network

Power Law

(19)

Barabási & Albert, Science 286, 509 (1999)

j j i i

k k k

= Σ Π( )

P(k) ~k-3

(1) Networks continuously expand by the addition of new nodes

WWW : addition of new documents

GROWTH:

add a new node with m links

PREFERENTIAL ATTACHMENT: the probability that a node connects to a node with k links is proportional to k.

(2) New nodes prefer to link to highly connected nodes.

WWW : linking to well known sites

Preferential attachment

Scale free networks

(20)

What about late comers?

Fitness model is model of the evolution of a network:

how the links between nodes change over time depends on the fitness of nodes. Fitter nodes attract more links at the expense of less fit nodes

(21)

Bose-Einstein Condensation in evolving networks

G. Bianconi and A.-L. Barabási, Physical Review Letters 2001; cond-mat/0011029

j j j

i i

i k

k η η

= Σ Network Π

η

) (η kin

) (η ρ

Bose gas

βε

e

1 ) 1

(ε = βε n e

) (ε g

Fit-gets-rich Bose-Einstein condensation

(22)

Robustness of Scale free networks

Complex systems maintain their basic functions even under errors and failures (cell mutations;

Internet router break)

node failure

fc

0 1

Fraction of removed nodes, f 1

S

(23)

Robustness of Scale free networks

Robustness case Attack case

(24)

Is a computer Intelligent?

Dr. Gautham Shroff: Course on Web Intelligence on coursera.org

(25)

Is a computer Intelligent?

Dr. Gautham Shroff: Course on Web Intelligence on coursera.org

(26)

Is a computer Intelligent?

Dr. Gautham Shroff: Course on Web Intelligence on coursera.org

(27)

Web Intelligence @ Web Scale AI is here

IBM Watson at Jeopardy 2011

Dr. Gautham Shroff: Course on Web Intelligence on coursera.org

(28)
(29)

Data Science?

Dr. Gautham Shroff: Course on Web Intelligence on coursera.org

(30)

Data Science?

(31)

Predicting Scientific Laws?

Eurequa : Already predicted fundamental equations

Patterns in data ...

(32)

facebook connection

(33)

Network Science?

• Watch this

(34)

• What is the dynamics of these network?

• How to control the complex network?

Network Science?

Principles shall be drawn from Control Theory

(35)

Inverse Problem

(36)

Dynamical Systems

• State variables: What is the number (min) of control points required to

drive the system?

• Linear systems: Kalaman Filter

• What about Non-linear systems?

(37)

Tail End

The 21st century," physicist Stephen Hawking has said, "will be the century of complexity."

Likewise, the physicist Heinz Pagels has said that "the nations and people who master the new sciences of

complexity will become the economic, cultural, and political superpowers of the 21st century."

(38)

References

Linked: How Everything Is Connected to Everything Else and What It Means for

Business, Science, and Everyday Life... by Albert-Laszlo Barabasi (Apr 29, 2003)

The Structure and Dynamics of Networks:

(Princeton Studies in Complexity) by Mark Newman, Albert-László Barabási and

Duncan J. Watts (Apr 17, 2006)

Bursts: The Hidden Patterns Behind

Everything We Do, from Your E-mail to

Bloody Crusades by Albert-Laszlo Barabasi (May 31, 2011)

(39)

Thank You

References

Related documents

SaLt MaRSheS The latest data indicates salt marshes may be unable to keep pace with sea-level rise and drown, transforming the coastal landscape and depriv- ing us of a

Although a refined source apportionment study is needed to quantify the contribution of each source to the pollution level, road transport stands out as a key source of PM 2.5

December 1978 Synopsis of marine prawn fishery of India for the second quarter of 1978 Crustacean Fishery Resources Team Experiment on polyculture in a brac- kish water fish farm

These gains in crop production are unprecedented which is why 5 million small farmers in India in 2008 elected to plant 7.6 million hectares of Bt cotton which

INDEPENDENT MONITORING BOARD | RECOMMENDED ACTION.. Rationale: Repeatedly, in field surveys, from front-line polio workers, and in meeting after meeting, it has become clear that

With respect to other government schemes, only 3.7 per cent of waste workers said that they were enrolled in ICDS, out of which 50 per cent could access it after lockdown, 11 per

Of those who have used the internet to access information and advice about health, the most trustworthy sources are considered to be the NHS website (81 per cent), charity

Harmonization of requirements of national legislation on international road transport, including requirements for vehicles and road infrastructure ..... Promoting the implementation