In Summary
To build an effective content summarizer is in many ways the same task as building an effective topical analysis and filtering system for unstructured content.
An effective summarizer - a program that 'reads' a lengthy article and then provides a natural language (or even bullet points for now, we can handle that) summary of the 'pertinent' points of the article means determining the key topics and then extracting and re-phrasing those points in shortened format. So, the challenge is not only what are the topics, but what are the "key" topics.
Then we get into subjectivity. The ultimate arbiter of what are the key topics is the author, but different readers may get different values from the content so ultimately the decision on what the key topics are depends on the needs of the audience. This reader is looking for information on nanotechnology, is that a key topic in the piece or just a passing mention?
These are the challenges. What's the key topic of this blog entry? Well, I'm not going to make it easy and put any labels on it..
An effective summarizer - a program that 'reads' a lengthy article and then provides a natural language (or even bullet points for now, we can handle that) summary of the 'pertinent' points of the article means determining the key topics and then extracting and re-phrasing those points in shortened format. So, the challenge is not only what are the topics, but what are the "key" topics.
Then we get into subjectivity. The ultimate arbiter of what are the key topics is the author, but different readers may get different values from the content so ultimately the decision on what the key topics are depends on the needs of the audience. This reader is looking for information on nanotechnology, is that a key topic in the piece or just a passing mention?
These are the challenges. What's the key topic of this blog entry? Well, I'm not going to make it easy and put any labels on it..
1 Comments:
effective content summarizer ways task building effective topical analysis filtering system unstructured content
are some topic "keywords"
Post a Comment
Subscribe to Post Comments [Atom]
<< Home