Dr. Aixin SUN

School of Computer Engineering, Nanyang Technological University, Singapore

  • Home
  • Publications
  • Projects
  • Teaching
  • Datasets
  • Contact

Image Tag Clarity [Dataset and Experimental Results]


The data set used for computing the Image Tag Clarity is available online at NUS-Wide. The normalized image tag clarity scores for the 5981 most popular tags are available in Excel and zipped files.  Note that, the values reported here might be slightly different from the values reported in the paper due to the different number of dummy tags used for estimating the expected image clarity scores. The mean/std of image clarity scores for a given frequency reported here are estimiated through 500 dummy tags (while the values reported in the paper are estimated through 100 dummy tags). Please cite the following paper if you use the above results in your work (e.g., filtering tags by visual concepts, visual representativeness, or others).  Please drop me an email at axsun AT ntu DOT edu DOT sg if you have any comments regarding the paper or the experimental results. 
  1. Aixin Sun and Sourav S. Bhowmick. Image Tag Clarity: In Search of Visual-Representative Tags for Social Images.  In Proc. of the 1st ACM SIGMM Workshop on Social Media (WSM09) in conj. with ACM MM, Beijing, China. Oct 2009.

UnitSet


The UnitSet is the one used for Web Unit Mining project. The dataset is created based on the WebKB dataset which is available at Web->KB project. Please cite any of the following two papers if you would like to use UnitSet in your experiments:
  1. Aixin Sun and Ee-Peng Lim, Web Unit Based Mining of Homepage Relationships, Journal of the American Society for Information Science and Technology (JASIST), 57(3):394-407. February 2006.
  2. Aixin Sun and Ee-Peng Lim, Web Unit Mining Finding and Classifying Subgraphs of Web Pages. In Proc. of 12th ACM International Conference on Information and Knowledge Management (CIKM 2003), pp. 108-115, New Orleans, LA, USA, Nov. 2003.
This is a personal page maintained by the author. The ideas and information expressed on it have not been approved or authorised by NTU either explicitly or impliedly.