Browse Source

Initial Overleaf Import

master
Jeffery Russell 4 years ago
commit
8049e171b1
2 changed files with 56 additions and 0 deletions
  1. +36
    -0
      project-proposal.tex
  2. +20
    -0
      ref.bib

+ 36
- 0
project-proposal.tex View File

@ -0,0 +1,36 @@
% \documentclass[12pt]{extarticle}
\documentclass[12pt]{apa6}
\usepackage[utf8]{inputenc}
\usepackage{cite}
\usepackage{hyperref}
\title{Review Proposal: K-means Clustering Over Time}
%
\author{Jeffery B. Russell}
\affiliation{%
Fourth Year Computer Science Student at RIT\\
CUBRC Research Assistant\\
RITlug President
}%
\begin{document}
\maketitle
With the ubiquity of data in today's age, machine learning has become a driving force in research and innovation in both the public and private sectors. Due to the vast quantity of data generated by the internet, unsupervised learning has been at the forefront of data science over the past few decades. Clustering has been and will continue to be an essential way of extracting data from large sets of information.
This project aims to break down and analyze two papers that use k-means clustering. In 1967 J. MacQueen wrote the historical article that introduced \cite{k-means} the original k-means clustering. MacQueen's paper originally coined the term k-means\cite{k-means} and is going to be reviewed because it provides useful historical insight into where this field of research started and what influenced it. Since the paper was written in 1967, this opens an interesting area of investigation because we can compare a relatively older research paper on clustering with a new one. This paper can be found on the Project Euclid website \footnote{\url{https://projecteuclid.org/download/pdf_1/euclid.bsmsp/1200512992}}.
The second paper that I will review was written by John Paparrizos and Luis Gravano in 2016 and covers how we can cluster time series data using an algorithm they call k-shape, which was inspired by the k-means algorithm \cite{k-shape}. The paper can be found on the University of Colombia's website\footnote{\url{http://web2.cs.columbia.edu/~gravano/Papers/2015/sigmod2015.pdf}}.
The K-shape paper is of special interest to me because I used during my last co-op while researching Space Situational Awareness (SSA).
Daniel Moore is going to be the peer reviewer for this literature review.
Although both papers are over ten pages, they are appropriate for this assignment due to their significance in the field and my prior knowledge of the topic.
\bibliographystyle{plain}
\bibliography{ref}
\end{document}

+ 20
- 0
ref.bib View File

@ -0,0 +1,20 @@
@article{k-shape,
author = {Paparrizos, John and Gravano, Luis},
year = {2016},
month = {06},
pages = {69-76},
title = {k-Shape: Efficient and Accurate Clustering of Time Series},
volume = {45},
journal = {ACM SIGMOD Record},
doi = {10.1145/2949741.2949758}
}
@Misc{k-means,
Author = {J. {MacQueen}},
Title = {{Some methods for classification and analysis of multivariate observations.}},
Year = {1967},
Language = {English},
HowPublished = {{Proc. 5th Berkeley Symp. Math. Stat. Probab., Univ. Calif. 1965/66, 1, 281-297 (1967).}},
MSC2010 = {62H30},
Zbl = {0214.46201}
}

Loading…
Cancel
Save