Search results for key=NgS2004 : 1 match found.

Refereed full papers (journals, book chapters, international conferences)

2004

Hieu T. Nguyen and Arnold Smeulders, Everything Gets Better All the Time, Apart from the Amount of Data, In Proceedings of the Third International Conference on Image and Video Retrieval (CIVR 2004), Dublin, Ireland, No. 3115 in Lecture Notes in Computer Science, pp. 33-41, Springer-Verlag, July 21-23 2004.

The paper first addresses the main issues in current content-based image retrieval to conclude that the largest factors of innovations are found in the large size of the datasets, the ability to segment an image softly, the interactive specification of the users wish, the sharpness and invariant capabilities of features, and the machine learning of concepts. Among these everything gets better every year apart from the need for annotation which gets worse with every increase in the dataset size. Therefore, we direct our attention to the question what fraction of images needs to be labeled to get an almost similar result compared to the case when all images would have been labeled by annotation? And, how can we design an interactive annotation scheme where we put up for annotation those images which are most informative in the definition of the concept (boundaries)? It appears that we have developed an random followed by a sequential annotation scheme which requires annotating 1% equal to 25 items in a dataset of 2500 faces and non-faces to yield an almost identical boundary of the face-concept compared to the situation where all images would have been labeled. This approach for this dataset has reduced the effort of annotation by 99%.