Syracuse University ยท Data Lab

๐ŸŒ Livemocha

Livemocha is the world's largest online language learning community, offering free and paid online language courses in 35 languages to more than 6 million members from over 200 countries around the world.

104K
Nodes
2.2M
Edges
21.0
Avg Degree
No
Missing
Network Statistics
104K
Total Nodes
2.2M
Total Edges
21.0
Avg Degree
Social Network
Category
Size Relative to Repository Maximum
Nodes
104K
Edges
2.2M
Nodes & Edges โ€” Repository Comparison
Highlighted bar = this dataset. Logarithmic scale.
Edge-to-Node Ratio
Network density indicator
Dataset Details

Dataset Information

2 files are included:

1. nodes.csv
-- it's the file of all the users. This file works as a dictionary of all the users in this data set. It's useful for fast reference. It contains
all the node ids used in the dataset

2. edges.csv
-- this is the friendship network among the users. The friends are represented using edges.
Here is an example.

1,2

This means user with id "1" is friend with user id "2".

Attribute Information

This dataset contains the friendship network crawled from www.livemocha.com in December 2010 by Xia Hu (Ben) (). For easier understanding, all the contents are organized in CSV file format.

-. Basic statistics
Number of Nodes: 104,438
Number of Edges: 2,196,188
How to Cite
If you publish material based on data from this repository, please acknowledge the Data Lab Social Computing Data Repository at Syracuse University in your acknowledgements. This helps others find and replicate your work.

APA Format

R. Zafarani and H. Liu. (2026). Social Computing Data Repository [https://datasets.syr.edu]. Data Lab, Syracuse University.
@misc{Data Lab:SU,
  author       = {R. Zafarani and H. Liu},
  year         = {2026},
  title        = {Social Computing Data Repository},
  url          = {https://datasets.syr.edu},
  institution  = {Data Lab, Syracuse University}
}