X-Git-Url: https://www.fi.muni.cz/~kas/git//home/kas/public_html/git/?a=blobdiff_plain;f=pan13-paper%2Fpan13-notebook.tex;h=3febdcc6c2cee3222d7038b9e19f165b4878bb69;hb=479a615009e1e6eaa1efd9714ce65511d40e2e6e;hp=a6d2ba3f9c7dc1bbd7590de86fcd611a417f2864;hpb=a864666d2318f3ba5af8f0d53d36981254386043;p=pan13-paper.git diff --git a/pan13-paper/pan13-notebook.tex b/pan13-paper/pan13-notebook.tex index a6d2ba3..3febdcc 100755 --- a/pan13-paper/pan13-notebook.tex +++ b/pan13-paper/pan13-notebook.tex @@ -19,22 +19,33 @@ \begin{abstract} This paper describes approaches used for the Plagiarism Detection task in PAN 2013 international competition -on uncovering plagiarism, authorship, and social software misuse. - +on uncovering plagiarism, authorship, and social software misuse. +We present modified three-way search methodology for Source Retrieval subtask and analyse snippet similarity performance. +Next, we show changes in selected feature for text alignement which led to plagdet score improvement. +The results of source retrieval show, that presented approach is adaptable in real-world plagiarism situations. +Improved results for text alignment achieved in the competition overall third place. \end{abstract} \section{Introduction} - -The notebooks shall contain a full write-up of your approach, including all details necessary to reproduce your results. +In PAN 2013 competition on plagiarism detection we participated in both the Source Retrieval +and the Text Alignment subtask. In both tasks we adapted methodology used in PAN 2012. +Section~\ref{source_retr} describes querying approach for source retrieval, where we used three different +types of queries. We present a new type of query based on text paragraphs. +The query execution were controled by its type and by preliminary similarities +discovered during the searches. +In section~\ref{text_alignment} we present modified common text feature fot text alignment. +We also compare performance of both the previous and the modified algorithms. -\include{simon-source_retrieval} -\include{yenya-text_alignment} +\input{simon-source_retrieval} +\input{yenya-text_alignment} \section{Conclusions} +Unfortunately the ChatNoir search engine does not support phrasal search, therefore it +is possible that evaluated results may be quite distorted in this manner. \bibliographystyle{splncs03} \begin{raggedright}