Image Tag Clarity [Dataset and Experimental Results]
The data set used for computing the Image Tag Clarity is
available online at
NUS-Wide.
The normalized image tag clarity scores for the 5981 most popular tags
are available in
Excel
and
zipped
files. Note that, the values reported here might be slightly
different from the values reported in the paper due to the different
number of dummy tags used for estimating the expected image clarity
scores. The mean/std of image clarity scores for a given frequency reported here are
estimiated through 500
dummy tags (while the values reported in the paper are estimated through 100 dummy tags). Please
cite the following paper if you use the above results in your work
(e.g., filtering tags by visual concepts, visual representativeness, or
others). Please drop me an email at
axsun AT ntu DOT edu DOT sg if you have any comments regarding the paper or the experimental results.
- Aixin Sun
and Sourav S. Bhowmick. Image
Tag
Clarity: In Search of
Visual-Representative Tags for Social Images. In Proc. of the
1st ACM SIGMM Workshop on
Social Media (WSM09)
in conj. with ACM MM, Beijing, China. Oct 2009.
UnitSet
The
UnitSet
is
the one used for Web Unit Mining project. The dataset is created based
on the WebKB dataset which is available at
Web->KB
project. Please cite any of the following two papers if you would like
to use
UnitSet
in your experiments:
- Aixin Sun and Ee-Peng Lim, Web Unit Based
Mining
of
Homepage Relationships, Journal of the American Society
for Information
Science and Technology (JASIST), 57(3):394-407. February
2006.
- Aixin Sun and Ee-Peng Lim, Web Unit
Mining Finding
and
Classifying Subgraphs of Web Pages. In Proc. of 12th ACM
International Conference on Information and Knowledge Management (CIKM
2003), pp. 108-115, New Orleans, LA, USA, Nov. 2003.