With 40-70% of total Internet traffic, P2P is unquestionably the main factor in ISPs' technological and business considerations. Would "Managed Peer Selection" be a solution?
The high throughput of P2P file-sharing applications is often achieved by establishing multiple simultaneous connections with other peers sharing the same content. Popular content, such as pirated movies or TV shows, is usually shared by hundreds or even thousands of peers distributed across the globe. When activated, the P2P client establishes connection with 50-100 other peers, usually selected randomly from a large set of appropriate peers.
Due to the random peer selection strategy, most of the peer connections have to go out/in of a given ISP network, significantly increasing the transit cost paid by the ISP. For example, in South Korea and Japan, as much as 80% of broadband traffic leaving the country does so over under-sea cables, which results in high transit charges. A large portion of this traffic belongs to P2P and can be eliminated by interconnection of local peers.
Managed Peer Selection is an actively discussed technology to optimize P2P overlay networks in accordance with ISP network topology and business rules. This discussion was initiated by the P4P working group and is supported by IETF: an Application-Layer Traffic Optimization (ALTO) working group is about to be established. These efforts are supported by some P2P application vendors (BitTorrent, Pando Networks) as well as major ISPs (Comcast, Verizon, Telefonica).
To estimate a feasibility of ALTO/P4P aproaches for real life torrents , I collected <ip,port> information for peers from one of the most popular "PirateBay" torrents ( almost 20k peers) and mapped their IPs to corresponded ASs (Autonomous Systems - group of IPs that belong to the same ISP. ISPs usualy have one or more ASs).
For investigation I chose a torrent that represents bootlegged copy of "Hellboy 2" movie released July 11 , 2008 for screening in US and Canada, but not yet legally available in other countries.
Data collection had been performed August 16, 2008. Peers information had been collected by periodic "announce" polling of the ThePrateBay tracker.
"IP to geo" and "IP to AS" mapping had been performed using WHOIS service from whois.cymru.com .
To make a long story
short - for such kind of swarm (20k
peers):
For illustration, the following graph represents cumulative distribution of the peers as function of the AS rank.
(another interesting graph is here)
Speculation:
Usually, 10 "good neighboring peers " sufficient to provide BitTorent client with maximum download bandwidth. If P2P application manage to give priority to connections inside of their own AS and "localise" traffic inside of ISP (AS) segment, localisation will eliminate 70-80% of BitTorrent traffic on the global backbone.
Certainly, it is a BIG speculation - I intentionally chose an unusually big BitTorrent swarm to this investigation. It will be interesting to evaluate AS distribution for small and medium swarms and extrapolate to all known torrents.
Please find more details in my work notes. You also welcome to download raw data in "per-AS" aggregation. Any comments, addition, corrections, etc are VERY welcome!.
Comments
This is a really great
This is a really great protocol and idea for P2P downloading i monitor my connections while downloading and know when connected to IP's from the same ISP i get amazing transfer time and dont need any other connections they are actually a waste. To have this built in and download from the top 10 best connections would be good but there has to be something built in to wait for you to establish a good connection with everyone and begin to download before it starts to disconnect you from different peers. I know it can take a few minutes to start receiving from every peer at full speed. This would be good for internet cafe software who have p2p programs setup over seas where their bandwidth is really important.
Would Teleco's go for P4P?
It is an interesting observation you have made with respect to reduction in transit traffic using P4P (ie managed p2p). But since in the US the "tier1" ISPs (aka most large telecos) who have ~1/2 of the eyeball traffic would not like the P4P arrangement. This leaves only the cable guys and small ISPs who would be interested since they pay transit. In that realistic scenario does P4P still make sense?