加载中…
个人资料
  • 博客等级:
  • 博客积分:
  • 博客访问:
  • 关注人气:
  • 获赠金笔:0支
  • 赠出金笔:0支
  • 荣誉徽章:
正文 字体大小:

网络和通讯:Flickr数据集(Flickr Data Set)

(2012-02-10 17:06:59)
标签:

数据集

e-mail

数据来源

数据信息

获取

杂谈

分类: 其他

获取请进:Flickr数据集(Flickr Data Set)

http://www.datatang.com/data/13785

 

1、数据信息:

Flickr Data Set

Abstract: A network data set crawled from Flickr. Both the contact network and selected group membership information are included.  

 

Number of Nodes:

80513

Number of Edges:

5899882

Missing Values?

no

 

Source:

Lei Tang*, Huan Liu*

* School of Computing, Informatics and Decision Systems Engineering, Arizona State University. E-mail: l.tang@asu.edu, huan.liu@asu.edu

Data Set Information:

4 files are included:

1. nodes.csv
-- it's the file of all the users. This file works as a dictionary of all the users in this data set. It's useful for fast reference. It contains
all the node ids used in the dataset

2. groups.csv
-- it's the file of all the groups. It contains all the group ids used in the dataset

3. edges.csv
-- this is the friendship network among the bloggers. The blogger's friends are represented using edges.
Since the network is symmetric, each edge is represented only once. Here is an example.

1,2

This means blogger with id "1" is friend with blogger id "2".

4. group-edges.csv
-- the user-group membership. In each line, the first entry represents user, and the 2nd entry is the group index.

If you need to know more details, please check the relevant papers and code:
http://www.public.asu.edu/~ltang9/social_dimension.html

 

Attribute Information:

This is the data set crawled from Flickr ( http://www.Flickr.com ). Flickr is an image hosting and video hosting website, web services suite, and online community.
This contains the friendship network crawled and group memberships. For easier understanding, all the contents are organized in CSV file format.

-. Basic statistics
Number of users : 80,513
Number of friendship pairs: 5,899,882
Number of groups: 195

Relevant Papers:

1. Lei Tang and Huan Liu. Relational Learning via Latent Social Dimensions. In Proceedings of The 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD’09), Pages 817–826, 2009.

2. Lei Tang and Huan Liu. Scalable Learning of Collective Behavior based on Sparse Social Dimensions. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM’09), 2009.

Citation Request:

Please refer to the Social Computing Data Repository's citation policy

http://image.datatang.com/data/2012/2/10/20120210133308356.jpgData Set)" />

http://image.datatang.com/data/2012/2/10/20120210133326482.jpgData Set)" />

http://image.datatang.com/data/2012/2/10/20120210133341076.jpgData Set)" />

2、数据大小:14.96M

3、数据来源:

Lei Tang*, Huan Liu*
* School of Computing, Informatics and Decision Systems Engineering, Arizona State University. E-mail: l.tang@asu.edu, huan.liu@asu.edu

0

阅读 收藏 喜欢 打印举报/Report
  

新浪BLOG意见反馈留言板 欢迎批评指正

新浪简介 | About Sina | 广告服务 | 联系我们 | 招聘信息 | 网站律师 | SINA English | 产品答疑

新浪公司 版权所有