Newsvine
  • Welcome
  • Help
  • Report Bug
  • Conversation Tracker
  • Your Column
  • Replies
  • Friends
Type Comments Since You Last CheckedArticle Source Last Checked Stop Tracking All Clear Tracking All
advertisement
Log In | Register
Close the Login Panel
Existing users log in below. New users please register for a free account.

New Users:

Existing Users:

E-Mail:
Password:
Forgot Password?
Please enter the e-mail address or domain name you registered with:
E-Mail/Domain:
Back to Login
Log Out
  • Top News
  • Local News
  • World
  • U.S.
  • Sports
  • Politics
  • Tech
  • Entertainment
  • Science
  • Business
  • Health
  • Odd News
  • More
    • Arts
    • Education
    • Environment
    • Fashion
    • History
    • Home & Garden
    • Not News
    • Religion
    • Travel
Visit kilometer's column >>

KILOMETER

former peace corps volunteer in grad school in the middle east
Articles Posted: 5  Links Seeded: 49
Member Since: 1/2006  Last Seen: 5/27/2007

What is Newsvine?

Updated continuously by citizens like you, Newsvine is an instant reflection of what the world is talking about at any given moment.

Get a Free Account
Help
Fun Stuff
  • Your Clippings
  • Leaderboard
  • E-Mail Alerts
  • Top of the Vine
  • Newsvine Live
  • Newsvine Archives
  • The Greenhouse
  • Recommended Articles
  • Wall of Vineness
Put a Seed Newsvine link on your own site

What to do about Redundancy in the Vine?

Wed Feb 8, 2006 4:53 AM EST
technology, newsvine, stories, seed, vine, seeding, redundancy
By kilometer
advertisement

lately, i've been noticing a lot more redundancy in the vine - like three articles in a row seeding to three different generic stories about the mohammed cartoon uproar. these redundancies make newsvine less enjoyable as i find myself having to sift through old news to find new news, which is the point of newsvine in the first place. how do we get rid of these? does the newsvine community need to be more responsible by searching for terms in their article before just blindly seeding? as newsvine grows it could become a bigger problem, especially because new users seem to be very eager to seed.

i was thinking of either adding a vote against box - where articles with negative points get pushed off the front page or a redundancy item in the "!" box. the second option would have to be checked by someone and this would probably be someone's full time job when newsvine comes out of beta.

do you have any other suggestions for how to easily weed out stories that don't actually add new content or discussion to the vine? keep seeding.

haha. just saw the new duplicate option under the "!" box. does this get checked manually? great job guys!

  • Enjoy this article? Help vote it up the 'Vine.

Back To Top | Front Page

Published to:

  • kilometer's Column, All of Newsvine
  • Groups: none
  • Regions: none
  • Public Discussion (35)
Jump to discussion page: 1 2
Alexander Repty

I actually like the way digg.com handles this: when submitting a story, the system automatically searches for possible similar or duplicate stories. On Newsvine, this might be even better as we have the great virtue of tags.

    Reply#1 - Wed Feb 8, 2006 5:24 AM EST
    SamuraiChamploo

    Digg.com also allows users to "bury" stories and report them as lame, off-topic, duplicates or inaccurate. I would like to see stuff like this worked into Newsvine too. Right now there are three stories on the front page about the 1gb iPod nano.

      Reply#2 - Wed Feb 8, 2006 7:02 AM EST
      Christopher Masto

      It would be better if you could identify what it's a duplicate of, and after somehow vetting that (x number of duplicate "votes" or something), "collapse" the stories into one.

      Also, I've only posted one link so far, but there did appear to be some kind of "this appears to be similar" message, although in my case it was for a completely unrelated story.

      Another problem that I see already here, and that made me stop reading digg, is the continual posting of "old stuff". Not necessarily duplicates, but people finding things on the web that they think are interesting and not realizing they're ancient history. This was particularly a problem for digg because most of the users seemed to be children. One example I remember is when the "IBM mouse ball memo" made it to the front page. I saw something similar to that on Newsvine yesterday, but I can't find the example now. It may be a harder problem to solve, especially if it's not really a problem for everyone.. if you haven't seen the dancing baby, it may be news to you.

        Reply#3 - Wed Feb 8, 2006 7:21 AM EST
        smithcoDeleted
        wolfger

        I agree with Smithco. Just because there are three "redundant" stories on the vine does not mean that all three stories say the same exact things. Multiple sources are a good thing.

          Reply#5 - Wed Feb 8, 2006 7:43 AM EST
          Tobsterius

          I agree that multiple sources are a good thing, but there really needs to be a way to weed out the duplicate articles. Especially if they are AP stories, and were just submitted from different news sites.

          I especially like the way Digg handles it. It searches the database for possible dupes before posting.. Of course, not that anyone over at Digg actually cares about posting dupes...

            Reply#6 - Wed Feb 8, 2006 8:19 AM EST
            Matt Hoult

            I think Christopher Masto, smithco and wolfger all make some very good points that are linked well. If Newsvine could find an algorithm (or even a functionality button that would allow users to manually make the change) which allows mulitple stories to be combined into one, each accredited to their respective seeders/authors then the stories are more valid overall. To repeat wolfger; multiple sources are a good thing.

              Reply#7 - Wed Feb 8, 2006 9:23 AM EST
              Matt Hoult

              Additionally (I wish you could edit comments in the first 5 minutes after posting) you have the bonus that once submitted the alternate sources would be red flagged by the existing architecture of Newsvine; thus disallowing any future seeding of that article.

                Reply#8 - Wed Feb 8, 2006 9:25 AM EST
                C.H. Castro

                This is from the Company Info page

                Established news sites like MSNBC and CNN are able to bring you a wealth of information from around the world mostly by editing and syndicating thousands of Associated Press articles every day. Depending on what your favorite news site is, somewhere between 70 and 90% come from the A.P. Newsvine syndicates these exact same articles but takes things a step further.

                Where its says "...Newsvine syndicates these exact same articles..." does that mean Newsvine itself has these stories already loaded or the members of Newsvine syndicate these stories through seeding. Because wouldn't that be causing some redundancy too.

                  Reply#9 - Wed Feb 8, 2006 11:11 AM EST
                  Sean S

                  I'm not sure if you guys noticed this, but recently Newsvine changed it's reporting verbiage to reflect new "Report As ..." labels. One of the new labels is "Duplicate Story".

                  Simply click on the "!" icon that appears below each article/seed title to report the item.

                  Also, concerning Alexander's first comment about automatic notification of an already seeded story: It's already implemented. When you click on the "Seed to Newsvine" button the popup window masthead will be red if the story/article has already been seeded. Additionally a link to the seed will be provided so you may join the discussion about it!

                    Reply#10 - Wed Feb 8, 2006 11:37 AM EST
                    SamuraiChamploo

                    ^^^Sorry for my incredibly dumb post up there (number 2). This is my first day here and i barely checked out the stuff on newsvine before i submitted that comment.
                    I also noticed Newsvine has no edit button eheheheheh....

                      Reply#11 - Wed Feb 8, 2006 12:10 PM EST
                      kilometer

                      multiple sources are great as wolfger, tobsterius, and others commented. but not when they all say the same thing - 3 headlines all saying basically "Apple announces 1gig ipod" from cnn, cnet, and wired are not really helpful to me and i think defeats the point of newsvine. if the article has a different angle or really does add to the discussion in some way then the headline should indicate as much - otherwise i want it buried and not cluttering up my front page. this might also force users to be more engaging when seeding articles and act as an incentive for them to do more than just leave the already existing headline if they want people to see it.

                        Reply#12 - Wed Feb 8, 2006 12:23 PM EST
                        Matt Hoult

                        @kilometer Does this really stand up if the links are added to the first seeded source of the article? Say you submitted an article from CNN which I then found on BBC; would me adding the link directly to the bottom of your post for further reading/discussion/information be a bad thing? It would not be cluttering but would offer more sources for (possibly) further information, confirmation and different takes on the topic.

                        I think this would work, but I am asking if you foresee something which I do not.

                          Reply#13 - Wed Feb 8, 2006 12:33 PM EST
                          theannalog

                          The only problem i see with reporting duplicate seeds is that sometimes both dupes get comments -- there needs to be some system for integrating these comments. other than that, I strongly support user-reporting of dupes.

                            Reply#14 - Wed Feb 8, 2006 12:38 PM EST
                            louiswu

                            It is interesting to see different angles on the same issue, but redundancy is a definite put-off. What'd be good is for articles on the same subject to be "clumped" together, so I see links to all related articles about a given news item. Based on reviews, posts that are too similar can be merged. This would make the process a tad more complex, but it's a not an easy problem to solve anyway.

                              Reply#15 - Wed Feb 8, 2006 12:41 PM EST
                              Mike D.

                              Industrial-strength-redundancy-and-relatedness-engine coming this week.

                              Codename: Doppelganger!

                                Reply#16 - Wed Feb 8, 2006 12:48 PM EST
                                Nathan

                                I think that rather than search by possible keywords, the way Digg does it, just search by root URL. I look forward to whatever the NV team has cooked up.

                                  Reply#17 - Wed Feb 8, 2006 1:11 PM EST
                                  thoughtful

                                  I think that since people want newvine to be "full of info" i.e. few redundancies three solutions might help:-
                                  1. adding a 10 word summary - this will have the key words, so that detecting similar articles will be easy.
                                  [Thsi might be like the META tags to be used by newsvine to help collate similiar articles.]
                                  2. having the system find you the entries with similar keywords.
                                  This way, you will not accidentally start a similar thread.
                                  3. News is topical and "perishable" so tagging the keywords with a date (e.g. 10 days old) will allow you to match common articles. [This is a suggestion to newsvine - I really don't know how to combine the time aspect
                                  - but I suspect it will improve the relevancy.]

                                    Reply#18 - Wed Feb 8, 2006 1:28 PM EST
                                    dave proth

                                    How bout an x that clears the story from you own home page. This way u can track what stories you have read or don't care to read.

                                      Reply#19 - Wed Feb 8, 2006 1:36 PM EST
                                      Atticus Finch

                                      I like the idea of clumping related stories together. Maybe having one main article, with a list of several other articles below. Kind of like how Wikipedia has cites and links to additional materials at the bottom. Also, that way the discussion would all be in one central location.

                                        Reply#20 - Wed Feb 8, 2006 2:17 PM EST
                                        Maria G.

                                        Can't wait for the new "industrial-strength-redundancy-and-relatedness-engine" coming this week! Sounds great.

                                        I'd prefer that exact dupes be deleted; maybe their comments could be ported over to the first instance of that article.

                                        Dupes that have a slightly different angle or are reported by a different source...I'd prefer these to be grouped like Google. I like having the option to click on different sources and get different points of view.

                                          Reply#21 - Wed Feb 8, 2006 2:59 PM EST
                                          FiZ

                                          It's funny that I read this article because I have one that appears underneath it if you filter for the newsvine tag. I saw your article, but the only reason I still typed mine is beacuse I commented about the roles of bloggers/social journalists and internet information efficiency in general and less about site features. Though seeing the two next to eachother is funny to look at.

                                            Reply#22 - Wed Feb 8, 2006 2:59 PM EST
                                            Patrick Yan

                                            like someone who seeded a story in december and then the same story was seeded in february again -_-

                                              Reply#23 - Wed Feb 8, 2006 4:18 PM EST
                                              Tony Scida

                                              Mike D.:
                                              Can you title it: Doppelganger! The industrial strength redundancy and repetitiveness engine.

                                                Reply#24 - Wed Feb 8, 2006 5:38 PM EST
                                                Lee Stone

                                                Ive noticed that a lot of the AP articles seem to be duplicates as well - but there's no way of warning of this - does that mean they have priority over user articles?
                                                Will the new system allow the people who are posting these see if they are posting something which a newsvine user has already managed to post about.

                                                  Reply#25 - Wed Feb 8, 2006 5:58 PM EST
                                                  monkeywork

                                                  dave proth writes:

                                                  How bout an x that clears the story from you own home page. This way u can track what stories you have read or don't care to read.

                                                  Thats a great idea!

                                                    Reply#26 - Wed Feb 8, 2006 7:08 PM EST
                                                    Jump to discussion page: 1 2
                                                    Leave a Comment:
                                                    You're in Easy Mode. If you prefer, you can use XHTML Mode instead.
                                                    You're in XHTML Mode. If you prefer, you can use Easy Mode instead.
                                                    (XHTML tags allowed - a,b,blockquote,br,code,dd,dl,dt,del,em,h2,h3,h4,i,ins,li,ol,p,pre,q,strong,ul)
                                                    Newsvine Privacy Statement
                                                    As a new user, you may notice a few temporary content restrictions. Click here for more info.
                                                    FUN STUFF:
                                                    • Leaderboard |
                                                    • E-Mail Alerts |
                                                    • Top of the Vine |
                                                    • Newsvine Live |
                                                    • Newsvine Archives |
                                                    • The Greenhouse |
                                                    COMPANY STUFF:
                                                    • Code of Honor |
                                                    • Company Info |
                                                    • Contact Us |
                                                    • Jobs |
                                                    • User Agreement |
                                                    • Privacy Policy |
                                                    • About our ads
                                                    LEGAL STUFF:
                                                    • © 2005-2012 Newsvine, Inc. |
                                                    • Newsvine® is a registered trademark of Newsvine, Inc. |
                                                    • Newsvine is a property of msnbc.com