Скачать презентацию Image Information Retrieval Shaw-Ming Yang IST 497 E Скачать презентацию Image Information Retrieval Shaw-Ming Yang IST 497 E

5952add9f342b3219a7045de3a6a4368.ppt

  • Количество слайдов: 22

Image Information Retrieval Shaw-Ming Yang IST 497 E 12/05/02 Image Information Retrieval Shaw-Ming Yang IST 497 E 12/05/02

Overview n J. R. Smith and S. -F. Chang, Visually Searching the Web for Overview n J. R. Smith and S. -F. Chang, Visually Searching the Web for Content, IEEE Multimedia. July - Sept, 1997, Vol. 4, No. 3, pp. 1220; part of paper also in Columbia University CTR Technical Report # 45996 -25, 1996. n Introduction: 5 -min Recap Webseek n ¨ ¨ n What is Webseek? Data Collection Process Subject Classification Process Search and Retrieval Process Conclusion

5 -min Recap n Why is Image IR important? ¨ “a picture is worth 5 -min Recap n Why is Image IR important? ¨ “a picture is worth a 1000 words” ¨ Alternative form of communication ¨ Not everything can be described in text; Not everything can be described in images ¨ Popular medium of information on the Internet

5 -min Recap n Visual Feature Extraction ¨ What is the best way to 5 -min Recap n Visual Feature Extraction ¨ What is the best way to represent all data contained in an image empirically? n Multi-Dimensional Indexing ¨ What is the best way to scale large size image collections? n Retrieval System Design

What is Webseek? n n “Web. SEEK is a Content- Based Image and Video What is Webseek? n n “Web. SEEK is a Content- Based Image and Video Search and Catalog Tool for the Web. Search through more than 650, 000 images and videos. ” (Advent Project) Developed by The Advent Project at Columbia University ¨ Founded 1995 ¨ Foster industrial collaboration between researchers and media technology

What is Webseek? n More Specifically… Uses multiple agents to automatically analyze, index, and What is Webseek? n More Specifically… Uses multiple agents to automatically analyze, index, and assign images/videos to subject classes ¨ Uses both visual content and text for cataloging and searching ¨ n Features ¨ ¨ ¨ ¨ Searching using image content-based techniques Query modification using content-based relevance feedback Automated collection of visual information Compact presentation of images and videos for displaying query results Image and video subject search and navigation Text-based searching Search results lists manipulations n n intersection, subtraction and concatenation. http: //www. ctr. columbia. edu/webseek

Data Collection Process n Autonomous “spiders” ¨ Traversal Spider – “assembles lists of candidate Data Collection Process n Autonomous “spiders” ¨ Traversal Spider – “assembles lists of candidate Web documents that may include images, videos, or hyperlinks to them” ¨ Hyperlink Parser – “which extracts the Web addresses of images and videos” ¨ Content Spider – “which retrieves, analyzes, and iconifies the images and videos”

Data Collection Process Data Collection Process

Data Collection Process Data Collection Process

Data Collection Process n Content Spider Functions ¨ “extracts visual features that allow for Data Collection Process n Content Spider Functions ¨ “extracts visual features that allow for content-based searching, browsing, and grouping” ¨ “extracts other attributes such as width, height, number of frames, type of visual data” n Color histogram ¨ “generates an icon, or motion icon, which sufficiently compacts and represents the visual information to be used for browsing and displaying query results” n Compression algorithms

Subject Classification Process n “text provides clues about the semantic content of visual information” Subject Classification Process n “text provides clues about the semantic content of visual information” ¨ URL ¨ File n name Text clues can be found in HTML syntax ¨ [alt ¨ [hyperlink text]

Subject Classification Process n Term extraction Extracted from URLs, alt tags, hyperlink text by Subject Classification Process n Term extraction Extracted from URLs, alt tags, hyperlink text by removing nonalpha characters ¨ Fkey (URL) = Fchop (“animals/domestic-beasts 1/dog 37”) = “animals, ” “domestic, ” “beasts, ” “dog. ” ¨ n Dictionary name extraction ¨ n Fdir (URL) = “animals/domestic-beasts. ” Key-term dictionary Terms and Dictionary names are used to create t*k terms ¨ t*k terms identified semantically related to subject classes sm ¨ Mkm: t*k sm ¨

Subject Classification Process Subject Classification Process

Search and Retrieval Process n Search results list manipulation ¨A = Query (Term = Search and Retrieval Process n Search results list manipulation ¨A = Query (Term = “sunset”) ¨ Returns Query A results ¨ Select Query B from Query A results ¨ B = Query (Term = “nature”) ¨ C = A ∩ B = Query (Term = “sunset” and Term = “nature”)

Search and Retrieval Process Search and Retrieval Process

Search and Retrieval Process Search and Retrieval Process

Search and Retrieval Process n Content-based Techniques ¨ Color n n histograms dissimilarity “determines Search and Retrieval Process n Content-based Techniques ¨ Color n n histograms dissimilarity “determines the color dissimilarity between a query image and a target image. ” Indexes images by global color ¨ Integrated n n spatial and color query “users can graphically construct a query by placing color regions on a query grid” Analyzes “sizes, spatial locations, and relationships of color regions within the images”

Search and Retrieval Process Search and Retrieval Process

Conclusion n n Many similarities exist between traditional text-based IR systems and content-based IR Conclusion n n Many similarities exist between traditional text-based IR systems and content-based IR systems Although text-based IR systems have been relatively successful, CBIR still has many barriers to overcome. Visual Feature Extraction ¨ Multi-Dimensional Indexing ¨ Retrieval System Design ¨ n Although the discussed article is 5 years old, recent studies show Image IR research and development trends continue to focus on similar issues.

Conclusion n Future research directions ¨ Involving humans ¨ Identify high-level features n in Conclusion n Future research directions ¨ Involving humans ¨ Identify high-level features n in the R&D process concepts with low-level visual Finger print, human face ¨ Web oriented ¨ High dimensional indexing ¨ Performance evaluation criterion and standard ¨ Integration of Database and Computer Vision resources testbed

Bibliography n n n (2002) Image Retrieval. Retrieved: November 6, 2002, from: http: //140. Bibliography n n n (2002) Image Retrieval. Retrieved: November 6, 2002, from: http: //140. 120. 7. 1/~yloug/images/Image_Retrieval. PDF. Feng, X. (2002) Content-based Image Retrieval. Retrieved: November 6, 2002, from: http: //www. cse. ucsc. edu/classes/ee 264/Winter 02/xgfeng. ppt. J. R. Smith and S. -F. Chang, Visually Searching the Web for Content, IEEE Multimedia. July - Sept, 1997, Vol. 4, No. 3, pp. 12 -20; part of paper also in Columbia University CTR Technical Report # 45996 -25, 1996. Rui, Y. , Huang, T. S. , & Chang, S. F. (1999). Image Retrieval: Current Techniques, Promising Directions and Open Issues. Vaidya, D. (2002) Visual Information Retrieval. Retrieved: November 6, 2002, from: http: //www-isl. ece. arizona. edu/islpresentations/VISUAL%20 INFORMATION%20 RETRIEVAL. ppt Note: All images and quoted content are from J. R. Smith and S. -F. Chang’s Visually Searching the Web for Content.