skip to main content
10.5555/1283383.1283494acmconferencesArticle/Chapter ViewAbstractPublication PagessodaConference Proceedingsconference-collections
Article

k-means++: the advantages of careful seeding

Published:07 January 2007Publication History

ABSTRACT

The k-means method is a widely used clustering technique that seeks to minimize the average squared distance between points in the same cluster. Although it offers no accuracy guarantees, its simplicity and speed are very appealing in practice. By augmenting k-means with a very simple, randomized seeding technique, we obtain an algorithm that is Θ(logk)-competitive with the optimal clustering. Preliminary experiments show that our augmentation improves both the speed and the accuracy of k-means, often quite dramatically.

References

References are not available

  1. k-means++: the advantages of careful seeding

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SODA '07: Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms
      January 2007
      1322 pages
      ISBN:9780898716245
      • Conference Chair:
      • Harold Gabow

      Publisher

      Society for Industrial and Applied Mathematics

      United States

      Publication History

      • Published: 7 January 2007

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      SODA '07 Paper Acceptance Rate139of382submissions,36%Overall Acceptance Rate411of1,322submissions,31%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader