\definecolor{ReallyEmph}{rgb}{0.7,0,0}\r
\r
\renewcommand{\titlesize}{\Huge}\r
-\title{Diverse Queries and Feature Type Selection \\ for Plagiarism Discovery}\r
+\title{Diverse Queries and Feature Type Selection for Plagiarism Discovery}\r
\r
% Note: only give author names, not institute\r
\author{Šimon Suchomel, Jan Kasprzak, and Michal Brandejs}\r
\renewcommand{\SubSection}[2][?]{\r
\vspace{0.5\secskip}\r
\refstepcounter{subsection}\r
- {\bf \subsectionsize \textcolor{SectionCol}{\arabic{section}.\arabic{subsection}~#2}}\r
+ {\bf \subsectionsize \textcolor{SectionCol}{\arabic{section}.\arabic{subsection}~\hbox{#2\hskip-15cm\hbox{}}}}\r
\par\vspace{0.375\secskip}\r
}\r
\r
Having this measure, a threshold for download decision needs to be set in order to maximize all discovered similarities\r
and minimize total downloads.\r
A profitable threshold is such that matches with the largest distance between those two curves.\r
+\r
\begin{figure}\r
\centering\r
\includegraphics[width=0.8\textwidth]{img/snippets_graph.pdf}\r