Search Logger
Posts from: Administrator

Author Archive

Search the Web Through a Vertical Lens

7:00 pm - November 26, 2008 in Yahoo! Search Blog

Over the last few months we’ve showcased a handful of innovative mashups that developers have created using Yahoo! Search BOSS. The creations have ranged in functionality and focus, but all have included interesting applications of Yahoo! Search’s index, infrastructure and technology. Today, we’re sharing another useful application of BOSS that TechCrunch just launched - a technology-flavored Web search engine. This new search engine enables TechCrunch users to search for technology-focused articles and company information across both the TechCrunch network of sites and the rest of the Web.

This partnership with TechCrunch illustrates a new BOSS capability called vertical lens technology, which enables partners, often with no search expertise, to create a truly comprehensive vertical search engine that complements their core user experience. Using BOSS, TechCrunch now provides a one-stop shop for all types of search. BOSS not only improves TechCrunch’s standard site search, it also integrates relevant technology content on people, products and companies from across the Web. And if a user conducts a non-tech focused search, they’ll see relevant Web search results.

BOSS.TechCrunch

BOSS vertical lens technology provides a handful of features that were implemented on TechCrunch:

  • Real-time indexing of proprietary content
  • - Once TechCrunch content is added or a user comment is submitted, the search index almost immediately reflects the changes.

  • Customized ranking
  • - The BOSS and TechCrunch teams worked closely to fine-tune the ranking algorithm to fit the TechCrunch audience and user experience.

  • Structured search
  • - BOSS supports faceted refinement around TechCrunch’s structured content. This offers powerful tools for people who want to fine tune their searches or browse TechCrunch content - e.g. narrowing in on articles on Twitter written only by Michael Arrington, or only the most popular articles (popularity is determined by number of comments), or locating all companies that are founded in 2008.

  • Blending Web with proprietary content in a single search display
  • - After querying both the TechCrunch and the Web indices, BOSS blends the results to produce one result set that is both relevant and comprehensive.

BOSS vertical lens is currently available to certain Yahoo! partners - but we’re working to share the technology more openly through the BOSS API. If you are interested in building a search product using BOSS vertical lens technology, please email us here. Our partnership with TechCrunch highlights the beginning of many more BOSS partner launches as Yahoo! continues to open up its search technologies and infrastructure to the world.

Take a look at TechCrunch’s article for additional background and insight.

Stay tuned.

YaJie Ying
BOSS Team

 

Searching Through 2008 – A Look Back

5:46 pm - December 1, 2008 in Yahoo! Search Blog

It’s that time of year again when we look back at what was 2008 through the billions of search queries we receive each month and from the millions of users who help paint this picture. This year we’ve teamed up with Yahoo! Buzz to bring to you the Yahoo! 2008 Year in Review.

Not only are we presenting the top ten searches, we are also including overall themes and popular trends that bubbled to the surface , including the economy, politicians, news stories, Olympians, rising celebrities, influential women, and notable deaths. Searches within these themes covered the gamut, from Miley Cyrus’ controversial Vanity Fair cover to the notorious pregnant man and farewell searches of Heath Ledger and Bernie Mac. Michael Phelps also pulled in the highest searches of any Olympian, while Angelina Jolie and Sarah Palin gained top positions in the influential women category.

Additional “top tens” were also included in this year’s review across select areas of Yahoo! such as, Yahoo! Food, Shine, Tech, Green, Shopping, Travel, Games, Movies, Mobile, Music, Local, Upcoming, Hot Jobs, most Buzzed-Up stories and clicked-on stories from yahoo.com.

What a year it was! Looking over all of the top ten lists and search activity for 2008, we pulled out some interesting tidbits we thought you’d find interesting. Users were captivated by this year’s historic US Presidential election, with the Olympic triumphs closely following. The economic downturn was another issue that grabbed up a lot of user’s attention. Of course, there was still time to enjoy a little escape by devouring celebrity gossip and delving into online video games.

And despite all of the activity surrounding the 2008 Presidential elections, Britney Spears still gained the number one spot as the top searched person (7 out of 8 years). She did make some news this year with a new album, three MTV Music Awards, a sitcom guest star role and documentary on her rise. People just can’t get enough of her!

Barack Obama was the second most searched person in 2008 and the third most popular search term. Headlines of his election victory received more clicks than any other story on yahoo.com this year. Obama soared up the search ranks as the election campaign moved forward with consumers using search to get informed, learn more about him and decide how to vote in this critically important election.

There was some debate on whether or not Britney Spears could outrank President-elect Barack Obama as the most searched term and most searched person in 2008. According to a recent Yahoo! Search survey*, most people thought Barack Obama (38 percent) would be the most searched person on the Web followed by Britney Spears (27 percent). It’s not over yet, though! With one more month remaining in 2008, President Obama may still outrank his top search opponent.

To find out more about the Yahoo! 2008 Year in Review, go to yearinreview.yahoo.com.
Incidentally, another way to look back at 2008 is through our monthly reviews of top movers - keywords that accelerated the fastest. We skipped January (hangover, sorry), but click through to see what buzzed in February, March, April, May, June, July, August, September, October, and November.

Stephen Rodi
Yahoo! Search

*Source: October 2008 survey from Yahoo! and Decipher Inc., which randomly sampled and surveyed 2,000 U.S. Internet Adults

 

Yahoo! Keynotes at Gilbane Boston 2008

8:00 pm - December 2, 2008 in Yahoo! Search Blog

Yahoo! Search will be in Boston this week for the 5th Annual Gilbane Conference. This year’s theme is “Where Content Management Meets Social Media,” covering topics that range from text analytics and semantic technologies to social computing. Yahoo!’s Head of Research, Prabhakar Raghavan, will be giving an opening keynote titled, “The Future of Search,” which will examine how search technology is evolving to deliver a richer, more meaningful experience to end users. Exploring search as a unifying application between Web content and services, Prabhakar will go over open tools for developers, such as SearchMonkey, and discuss how these tools can further enhance the search experience for Web consumers.

If you’re planning to attend the Gilbane Conference this week, be sure to stop by Prabhakar’s talk on Wednesday, December 3rd, at 8:30 a.m. at the Westin Copley Place Hotel.

For a complete list of talks, check out the full conference schedule. Hope to see you there.

Yahoo! Search Blog Team

 

Look Before You Click: Search Assist for Image Search

5:00 pm - December 3, 2008 in Yahoo! Search Blog

We just took a look back at the progress we’ve made over the last year with Search Assist and a look forward to our goals of continuing to improve the usefulness of our assistance technology.

With this objective in mind, we recently launched Search Assist for image search to guide users to the answers they seek in a new way. To better enhance the search experience, we took a two-tiered approach to the query formulation and refinement process. In the first stage, we’ve maintained the same paradigm of text suggestions that have helped users of web search complete and refine queries. The second stage is new - from now on, Search Assist can be opened after a query is entered into the search box for users to preview sample thumbnails for the top related search terms.

So if you search for roses in image search, you can preview the types of results you’ll get for red roses, Guns N’ Roses or black roses before clicking on the suggestion. This way, you know what images to expect before drilling down deeper to find your answers.

Roses

Search Assist for images has already been launched on the search results page and we’ll be launching it on Yahoo.com soon. Stay tuned for more improvements to Search Assist for image search in the near future. And please share your feedback in the comments below.

Kaushal Kurapati
Director of Product Management
Yahoo! Search

 

Yahoo! Search Takes on the Windy City

6:00 pm - December 4, 2008 in Yahoo! Search Blog

Chicago will have a lot more than wind to handle next week as Search Engine professionals flurry to the annual Search Engine Strategies Show. Among all the activities at the show, Yahoo! Search will be participating in a variety of panels, covering a range of topics including semantic search, search innovation, duplicate listings, link building, and blended search.

Below is a quick snapshot of where you can find us:

Monday, December 8th
Time: 10:30 - 11:30 a.m.
Session: Universal & Blended Search
Description: This panel will discuss the idea of search multiplicity and recent advancements that will change the world of search and marketing. The talk will also include research data available only at SES.
Yahoo!: Dr. Larry Cornett, VP, Consumer Products, Yahoo! Search

Time: 3:00 - 4:00 p.m.
Session: Is There Life Beyond Google?
Description: This session will delve deep into the search engine market and explore a variety of alternative and specialty search engines that provide innovative features and attributes not readily available on Google.
Yahoo!: Dr. Larry Cornett, VP, Consumer Products, Yahoo! Search

Time: 4:30 - 5:30 p.m.
Session: Semantic Search: How Will it Change Our Lives?
Description: This panel will explore the new generation of semantic technologies and projections for how it will change the future of search. Additionally, the session will cover how consumers will benefit directly from new sets of Internet destinations.
Yahoo!: Dr. Larry Cornett, VP, Consumer Products, Yahoo! Search

Tuesday, December 9th
Time: 10:30 - 11:45 a.m.
Session: Link Building Basics
Description: This session will discuss link analysis as an important component for ranking web pages and go over how Webmasters can increase traffic by building quality links.
Yahoo!: Sharad Verma, Senior Product Manager, Yahoo! Search Technology

Time: 4:15 - 5:30 p.m.
Session: Duplicate Content & Multiple Site Issues
Description: This session looks at the issues of duplicate content and explores solutions for best practices.
Yahoo!: Sharad Verma, Senior Product Manager, Yahoo! Search Technology

Thursday, December 11th
Time: 9:00 - 10:00 a.m.
Session: Think Local: Online Marketing Tactics for Small and Medium Sized Businesses
Description: This session will cover how advertisers can navigate through the sea of local marketing options to most effectively get in front of users.
Yahoo!: Atif Rafiq, Director of Strategy and Business Development for Yahoo! Local

For a complete conference schedule and to learn more about Yahoo!’s whereabouts, visit the SES agenda page.

Hope to see you there. Be sure to stop by the panels and say hi!

Yahoo! Search Blog team

 

Yahoo! Search Blog Offline for Site Maintenance

10:00 pm - December 4, 2008 in Yahoo! Search Blog

We wanted to give our loyal readers a heads up that tomorrow we’ll be making some backend updates to the Yahoo! Search Blog. If you come by for a visit and notice that the site is down, don’t be alarmed - we’re not going anywhere. We’ll be taking the blog offline early tomorrow morning and should be back up within an hour or two.

Of course, if you have any questions, let us know in the comments below.

Yahoo! Search Blog team

 

2008 Technology Awards for MashMaker and SearchMonkey

7:57 pm - December 5, 2008 in Yahoo! Search Blog

RWW Award

The Yahoo! SearchMonkey team would like to congratulate our colleagues at Intel MashMaker, who have just received Argentina’s Sodosky Award for Technology Innovation. As we’ve reported before, MashMaker is not only a powerful tool that enables non-programmers to extract semantic data from websites to create mashups, but you can also use MashMaker to rapidly create extractors for SearchMonkey applications. If you haven’t tried using MashMaker and SearchMonkey together yet, you can download MashMaker, click on the little monkey icon and give it a try.

In related award news, we are pleased to announce that ReadWriteWeb has recognized SearchMonkey as one of its Top 10 Semantic Web Products of 2008 for “the best use of Semantic Web by an Internet bigco this year.” We’d like to thank ReadWriteWeb for the recognition, and congratulate all of our co-winners: Powerset, Calais, Dapper, Hakia (powered by Yahoo! Search BOSS), TripIt, BooRah, AdaptiveBlue, Zemanta and UpTake. We know we’re in good company here and we look forward to making greater progress in 2009.

Evan Goer
Yahoo! SearchMonkey Team

 

BOSS Reaches a Milestone

5:00 pm - December 8, 2008 in Yahoo! Search Blog

We launched Yahoo! Search BOSS this past July and we just reached a significant milestone - the BOSS API is currently serving more than 10 million queries per day. 10 million in and of itself isn’t particularly significant, but we’re sharing it because we believe growing to more than 100 queries a second in just over 5 months says something about the demand for an open search platform. As a point of reference, the total queries from these developer-built, BOSS-powered search engines would rank ahead of the combined searches done on both Facebook and Amazon, and just behind Ask.com1. Note that because these queries are delivered via the BOSS API and served up by our partners, they aren’t counted as Yahoo! Search queries by comScore or other metrics providers.

BOSSChart

Since launch, we’ve been focused on adding features and building up the ecosystem. We’ll maintain that emphasis in 2009, as well as adding monetizing capabilities to the platform. Our plans have been driven by what we’ve heard from developers on our Yahoo! Group and at six hack days held around the world. We still have a great deal of work ahead, but the feedback we’ve received from the BOSS community has validated our hypothesis that there really was a need for a service like BOSS to help power innovation in search.

If you’re interested in checking out what’s been built using BOSS, here are a few places to look:

Thanks to everyone in the BOSS community for helping to make this happen and for being vocal about what we should be working on next. Keep it up!

Bill Michels
BOSS Team

——————————–
1comScore qSearch October 2008

 

An Interview with Dr. Rudi Studer on Semantic Search Technologies

6:39 pm - December 16, 2008 in Yahoo! Search Blog

Dr. Rudi Studer is no stranger to the world of semantic search. A full professor in Applied Informatics at University of Karlsruhe, Dr. Studer is also director of the Karlsruhe Service Research Institute, an interdisciplinary center designed to spur new concepts and technologies for a services-based economy. His areas of research include ontology management, semantic web services, and knowledge management. He has been a past president of the Semantic Web Science Association and has served as Editor-in-Chief of the journal Web Semantics.

In addition to his duties as director of the KSRI, Dr. Studer is a vice president for Semantic Technologies Institute International and helped found ontoprise GmbH, an enterprise software company built around deploying semantic technologies. Dr. Studer recently gave a talk at Yahoo! about semantic technologies, and he was kind enough to answer a set of follow-up questions about the future of semantic search.

Yahoo! (Y!): Could you please tell us about your research on semantic search at the University of Karlsruhe?

Rudi Studer (RS): We look at semantic search as a process of information access, where one or several activities can be supported by semantic technologies. These activities include preprocessing and extraction of information, the interpretation of user information needs, the actual query processing, the presentation of results, and finally, the processing of user feedback for subsequent queries and to generate improved refinements. In all of these steps, semantic technologies can be exploited. For example, with respect to interpreting user information needs, we work on techniques to automatically translate information needs, expressed in either natural language queries or keyword-based queries, into expressive queries that are specified in structured query languages, such as SPARQL.

Y!: Early on, semantic technologies drew criticism for overestimating their own short-term impact and failing to embrace some of the realities of the Web. In what ways do you think the semantic web community has matured since then?

RS: It’s true that in the Semantic Web community a lot of emphasis has been put on Semantics rather than on Web aspects. But, important to note, semantic technologies are not only about the Web. Many of these technologies, e.g. in the context of Enterprise Information Integration, were indeed successful in closed and controlled environments. Now, we’re beginning to see that these technologies are more and more applied to open Web environments, as well.

Of course there have also been many developments that focus on Web aspects in particular. In the context of combining Web 2.0 and Semantic Web technologies, we see that the Web is the central point. In terms of short term impact, Web 2.0 has clearly passed the Semantic Web, but in the long run there is a lot that Semantic Web technologies can contribute. We see especially promising advancements in developing and deploying lightweight semantic approaches.

Y!: In principle, semantic technologies should be able to help search engines more precisely match the user’s intent with the content on the page. But again, this has proven to be harder to realize than originally expected. Are we getting closer to the solution?

RS: No one ever said that it was going to be easy! But yes, we are getting closer. As I indicated before, many of the technologies today work well in closed environments (e.g. Enterprise scenarios), but do not necessarily scale to the Web (yet). But of course there is improvement on that side as well. Powerset (acquired by Microsoft this year), for example, is a good indicator of where we’re headed and certainly a proof point that we’re getting closer.

Y!: The semantic web suffers from a chicken-and-egg problem, where developers are unwilling to create applications due to a lack of metadata, and publishers are unwilling to expose metadata due to a lack of applications. What are some of the ways to break out of this deadlock?

RS: There are two solutions to this: First, we need to make it easier for publishers to produce semantic metadata and second, we need to make the benefits more obvious for the application developers.

With regard to the first aspect, a lot of the data is already available in structured form (e.g. in databases of the deep web), and technically straight-forward to expose in the form of RDF. The Open Linked Data Initiative is a good example of large numbers of data sources that have been published as RDF data. Then there is the unstructured data. Technologies like semantic wikis (e.g. the Semantic MediaWiki) allow the easy and seamless construction of semantic metadata as the content is produced.

The benefits of semantic metadata are becoming more and more obvious. At this year’s ISWC the Billion Triple Challenge uncovered a number of useful applications that show the benefits of combining existing Semantic Web data sources in an intelligent way.

Y!: How do you think major search engines supporting semantic technologies might contribute to the growth of the semantic web?

RS: Once search engines index Semantic Web data, the benefits will be even more obvious and immediate to the end user. Yahoo!’s SearchMonkey is a good example of this. In turn, if there is a benefit for the end user, content providers will make their data available using Semantic Web standards.

Y!: What do you think are some of the commercial opportunities left to be explored by semantic technologies?

RS: So far, semantic technologies have been used in commercial products for data integration, enterprise semantic search and content management, etc. I expect this area to grow, but prospectively I see more and more potential for business opportunities in the combination of the social web and semantic technologies as well as in the context of mashups. An area that is also still largely unexplored is the area of advertisements in the context of semantic search.

Y!: What are some of the pitfalls that developers run into when they first start investigating or deploying semantic metadata?

RS: One problem in the early days was that the tool support was not as mature as for other technologies. This has changed over the years as we now have stable tooling infrastructure available. This also becomes apparent when looking at the at this year’s Semantic Web Challenge.

Another aspect is the complexity of some of the technologies. For example, understanding the foundation of languages such as OWL (being based on Description Logics) is not trivial. At the same time, doing useful stuff does not require being an expert in Logics – many things can already be done exploiting only a small subset of all the language features.

Y!: If you’re a front-end developer who’s interested in finding out more about semantic metadata, where should you get started?

RS: There are now numerous books out there, e.g. Antoniou/van Harmelen: A Semantic Web Primer, Davies et al. (eds.): Semantic Web Technologies, and Staab/Studer (eds.): Handbook on Ontologies. There is also a large collection of video lectures at videolectures.net.

Of course the W3C recommendations for RDF, OWL and SPARQL are a useful reference. For inspiration, I recommend looking at some of the sites exploiting semantic technologies, e.g. semanticweb.org, Twine, or Freebase.

 

TechCrunch Crunchie Awards: Vote for BOSS!

10:14 pm - December 30, 2008 in Yahoo! Search Blog

TechCrunch recently announced that voting is open for the Crunchies. Yahoo! Search BOSS was selected as one of the six finalists for the Best Technology Innovation/Achievement category — so we’re asking for your support.

Click the badge below to vote for BOSS!

Thanks!
The BOSS Team

 
 
 
 
 
 
It's All About Search | © clsc.net |
2010.09.1009:42
Tech used here: Valid HTML - Valid CSS - Valid RSS - JavaScript - PHP - Smarty - MySQL - and a partridge in a pear tree.