- András Benczúr, Ph.D., head of laboratory
- Address: 1111 Budapest, Lágymányosi u. 11.
Room number: L 412
Phone: +36 1 279 6172
Fax: +36 1 209 5269
E-mail: benczur@sztaki.mta.hu
Homepage: http://datamining.sztaki.hu/
Department: Informatics Laboratory
András Benczúr is the head of Informatics Laboratory of 20 doctoral students, post-docs and developers hosting in addition a Theory of Computing and a Natural Language Technology groups. Andras received his Ph.D. at the Massachusetts Institute of Technology in 1997, since then his interest turned to Information Retrieval and Web Search. He was representing SZTAKI as principal investigator in several EU and national R&D projects. His research on spam filtering and low space approximations for very large scale Web analysis was awarded by a Yahoo Faculty Research Grant in 2006. Andras is SZTAKI site coordinator in the Hungarian Future Internet Platform and in the Hungarian node of the FET Flagship FuturICT.
Andras' Laboratory received a <a href="http://mta.hu/articles/momentum-program-of-the-hungarian-academy-of-sciences-130009">major grant of the President of the Hungarian Academy of Sciences</a> for our proposal on "Big Data" research in 2012.
Publications
[order by time] [order by categories ]2012.
- Flexible and Efficient Distributed Resolution of Large Entities
- Authors: Molnár, András, J.; Sidló, Csaba István; Benczúr, András, A.
Date: 2012.
Published by: FoIKS 2012, LNCS 7153 (Page: 2)
- Content-based trust and bias classification via biclustering
- Authors: Siklósi, Dávid; Daróczy, Bálint Zoltán; Benczúr, András A.
Date: 2012.
Published by: Proceeding WebQuality '12 Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web Quality (Page: 4)
- Big Web Analytics: Toward a Virtual Web Observatory
- Authors: Spaniol, Marc; Benczúr, András; Viharos, Zsolt János; Weikum, Gerhard
Date: 2012.
Published by: ERCIM News (Page: 2)
2011.
- Temporal analysis for web spam detection: an overview
- Authors: Erdélyi, Miklós; Benczúr, András
Date: 2011.
Published by: TWAW 2011. Proceedings of the 1st international temporal web analytics workshop. Hyderabad, 2011. (Page: 1)
- Web spam classification: a few features worth more
- Authors: Erdélyi, Miklós; Garzó, András; Benczúr, András
Date: 2011.
Published by: WebQuality 2011. Joint WICOW/AIRWeb workshop on web quality. Hyderabad, 2011. (Page: 2)
- SZTAKI @ ImageCLEF 2011
- Authors: Daróczy, Bálint Zoltán; Pethes, Róbert; Benczúr, András
Date: 2011.
Published by: CLEF 2011. Conference on multilingual and multimodal information access evaluation. Amsterdam, 2011.
- Longitudinal analytics on web archive data: it's about time!
- Authors: Weikum, G.; Ntarmos, N.; Spaniol, M.; Triantafillou, P.; Benczúr, András; Kirkpatrick, S.; Rigaux, P.; Williamson, M.
Date: 2011.
Published by: CIDR 2011. 5th biennial conference on innovative data systems research. Asilomar, 2011. (Page: 1)
- Infrastructures and bound for distributed entity resolution
- Authors: Sidló, Csaba István; Garzó, András; Molnár, András; Benczúr, András
Date: 2011.
Published by: QDB 2011.9th international workshop on quality in databases. Seattle, 2011. (Page: 1)
- Longitudinal Analytics on Web Archive Data: It's About Time!
- Authors: Weikum, Gerhard; Ntarmos, Nikos; Spaniol, Marc; Triantafillou, Peter; Benczúr, András; Scott, Kirkpatrick; Rigaux, Philippe; Williamson, Mark
Date: 2011.
Published by: 5th Biennial Conference on Innovative Data Systems Research
2010.
- SZTAKI @ TRECVID 2010
- Authors: Daróczy, Bálint Zoltán; Falavigna, Daniele; Gretter, Roberto; Nemeskey, Dávid Márk; Petrás, István; Pethes, Róbert; Benczúr, András
Date: 2010.
Published by: TRECVID 2010 Working Notes.
- An efficient block model for clustering sparse graphs
- Authors: Gyenge, Ádám Balázs; Sinkkonen, Janne; Benczúr, András
Date: 2010.
Published by: MLG 2010. Proceedings of the 8th workshop on mining and learning with graphs, in conjunction with SIGKDD 2010. Washington, 2010. (Page: 6)
- SZTAKI @ TREC 2010
- Authors: Garzó, András; Nemeskey, Dávid Márk; Pethes, Róbert; Siklósi, Dávid; Benczúr, András
Date: 2010.
Published by: TREC 2010 Working Notes
- SZTAKI @ ImageCLEF 2010
- Authors: Daróczy, Bálint Zoltán; Petrás, István; Benczúr, András; Nemeskey, Dávid Márk; Pethes, Róbert
Date: 2010.
Published by: CLEF 2010. Conference on multilingual and multimodal information access evaluation. Notebook Papers of CLEF 2010 LABs and workshops. Padua, 2010. (Page: 1)
- Interest point and segmentation-based photo annotation
- Authors: Daróczy, Bálint Zoltán; Petrás, István; Benczúr, András; Fekete, Zsolt; Nemeskey, Dávid Márk; Siklósi, Dávid; Weiner, Zsuzsa
Date: 2010.
Published by: CLEF 2009 workshop. Multilingual information access evaluation II. Multimedia experiments. Corfu, 2009. (Lecture notes in computer science 6242.) (Page: 3)
- Geographically organized small communities and the hardness of clustering social networks
- Authors: Kurucz, Miklós; Benczúr, András
Date: 2010.
Published by: Data mining for social network data, (Annals of information systems 12.) (Page: 1)
2009.
- Telephone call network data mining: a survey with experiments
- Authors: Kurucz, Miklós; Lukács, László; Siklósi, Dávid; Benczúr, András; Csalogány, Károly; Lukács, András
Editor: Bollobás, B.; Kozma, R.; Miklós, D.
Date: 2009.
Published by: Handbook of large-scale random networks. (Bolyai Society mathematical studies 18.) (Page: 1)
- Web spam challenge proposal for filtering in archives
- Authors: Benczúr, András; Erdélyi, Miklós Bálint; Masanes, Julien; Siklósi, Dávid
Date: 2009.
Published by: Airweb 2009. Proceedings of the 5th international workshop on adversarial information retrieval on the web. Madrid, 2009. (Page: 6)
- Web spam filtering in internet archives
- Authors: Erdélyi, Miklós Bálint; Benczúr, András; Masanes, Julien; Siklósi, Dávid
Date: 2009.
Published by: Airweb 2009. Proceedings of the 5th international workshop on adversarial information retrieval on the web. Madrid, 2009. (Page: 1)
- SZTAKI@ImageCLEF 2008: visual feature analysis in segmented images
- Authors: Daróczy, Bálint Zoltán; Fekete, Zsolt; Brendel, Mátyás; Rácz, Simon; Benczúr, András; Siklósi, Dávid; Pereszlényi, Attila
Date: 2009.
Published by: CLEF 2008. Evaluating systems for multilingual and multimodal information access. 9th workshop on the cross-language evaluation forum. Aarhus, 2008. (Lecture notes in computer science 5706.) (Page: 6)
- SZTAKI @ TRECVID 2009
- Authors: Daróczy, Bálint Zoltán; Nemeskey, Dávid Márk; Petrás, István; Benczúr, András; Kiss, Tamás
Date: 2009.
Published by: TRECVID 2009. TREC video retrieval evaluation. Working Notes.
- Linked latent dirichlet allocation in web spam filtering
- Authors: Biró, István; Siklósi, Dávid; Szabó, Jácint; Benczúr, András
Date: 2009.
Published by: Airweb 2009. Proceedings of the 5th international workshop on adversarial information retrieval on the web. Madrid, 2009. (Page: 3)
- SZTAKI @ ImageCLEF 2009
- Authors: Daróczy, Bálint Zoltán; Petrás, István; Benczúr, András; Fekete, Zsolt; Nemeskey, Dávid Márk; Siklósi, Dávid; Weiner, Zsuzsa
Date: 2009.
Published by: 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009
- Kapcsolatok és távolságok: a hazai vezetékes hívás-szokások elemzése
- Authors: Kurucz, Miklós; Siklósi, Dávid; Csalogány, Károly; Lukács, László; Benczúr, András; Lukács, András
Date: 2009.
Published by: Magyar Tudomány (Page: 6)
2008.
- Primal-dual approach for directed vertex connectivity augmentation and generalizations
- Authors: Végh, László; Benczúr, András
Date: 2008.
Published by: ACM Transactions on Algorithms (Page: 2)
- Web Spam Hunting @ Budapest
- Authors: Siklósi, Dávid; Benczúr, András; Fekete, Zsolt; Kurucz, Miklós; Bíró, István; Pereszlényi, Attila; Rácz, Simon; Szabó, Adrienn; Szabó, Jácint
Date: 2008.
Published by: Proc. Airweb 2008 in conjunction with WWW 2008
- Web spam: a survey with vision for the archivist
- Authors: Benczúr, András; Siklósi, Dávid; Szabó, Jácint; Bíró, István; Fekete, Zsolt; Kurucz, Miklós; Pereszlényi, Attila; Rácz, Simon; Szabó, Adrienn
Date: 2008.
Published by: IWAW 2008. 8th international web archiving workshop. Aarhus, 2008. (Page: 1)
- A comparative analysis of latent variable models for web page classification
- Authors: Bíró, István; Benczúr, András; Szabó, Jácint; Maguitman, Ana
Date: 2008.
Published by: LA-Web 2008. IEEE Latin American web conference 2008. Espírito Santo, 2008. (Page: 2)
- Overview of the imageCLEF 2007 object retrieval task
- Authors: Deselaers, Thomas; Hanbury, Allan; Viitaniemi, Ville; Benczúr, András; Brendel, Mátyás; Daróczy, Bálint Zoltán; Balderas, Hugo Jair Escalante; Gevers, Theo; Gracidas, Carlos Arturo Hernández; Hoi, Steven C. H.; Laaksonen, Jorma; Li, Mingjing; Castro, Heidy Marisol Marin; Ney, Hermann; Rui, Xiaoguang; Sebe, Nicu; Stöttinger, Julian; Wu, Lei
Date: 2008.
Published by: CLEF 2007. Advances in multilingual and multimodal information retrieval. 8th workshop of the cross-language evaluation forum. Budapest, 2007. (Lecure notes in computer science 5152.) (Page: 4)
- Multimodal retrieval by text--segment biclustering
- Authors: Benczúr, András; Bíró, István; Brendel, Mátyás; Csalogány, Károly; Daróczy, Bálint Zoltán; Siklósi, Dávid
Date: 2008.
Published by: CLEF 2007. Advances in multilingual and multimodal information retrieval. 8th workshop of the cross-language evaluation forum. Budapest, 2007. (Lecure notes in computer science 5152.)
- Deformable polygon representation and near-mincuts
- Authors: Benczúr, András; Goemans, Michel X.
Editor: Grötschel, M.; Katona, G. O. H.
Date: 2008.
Published by: Bilding bridges. Between mathematics and computer science. In honour of Laci Lovász. Budapest, 2008. (Bolyai Society mathematical studies 19.) (Page: 1)
- Increasing cluster recall of cross-modal image retrieval
- Authors: Rácz, Simon; Daróczy, Bálint Zoltán; Siklósi, Dávid; Pereszlényi, Attila; Brendel, Mátyás; Benczúr, András
Date: 2008.
Published by: CLEF 2008. Cross language evaluation forum. Aarhus, 2008. (Page: 1)
- Large-scale principal component analysis on LiveJournal friends network
- Authors: Kurucz, Miklós; Benczúr, András; Pereszlényi, Attila
Date: 2008.
Published by: KDD 2008. Proceedings of the 2nd KDD workshop on social network mining and analysis, held in conjunction with SIGKDD'08. Las Vegas, 2008. (Page: 1)
- Latent dirichlet allocation in web spam filtering
- Authors: Bíró, István; Benczúr, András; Szabó, Jácint
Editor: Castillo, C.; Chellapilla, K.; Fettery, D.
Date: 2008.
Published by: Airweb 2008. Proceedings of the 4th international workshop on adversarial information retrieval on the web. Beijing, 2008. (Page: 2)
- Cross-language retrieval with wikipedia
- Authors: Schönhofen, Péter; Benczúr, András; Bíró, István; Csalogány, Károly
Date: 2008.
Published by: CLEF 2007. Advances in multilingual and multimodal information retrieval. 8th workshop of the cross-language evaluation forum. Budapest, 2007. (Lecure notes in computer science 5152.) (Page: 7)
2007.
- Semi-supervised learning: a comparative study for web spam and telephone user churn
- Authors: Benczúr, András; Csalogány, Károly; Lukács, László; Siklósi, Dávid
Date: 2007.
Published by: ECML/PKDD 2007. 18th European conference on machine learning / 11th European conference on principles and practice of knowledge discovery in databases. Warsaw, 2007. (Page: 8)
- Spectral clustering in telephone call graphs
- Authors: Kurucz, Miklós; Benczúr, András; Csalogány, Károly; Lukács, László
Date: 2007.
Published by: WebKDD/SNAKDD 2007. Joint 9th WEBKDD and 1st SNA-KDD workshop '07. San José, 2007.
- Web spam detection via commercial intent analysis
- Authors: Benczúr, András; Bíró, István; Csalogány, Károly; Sarlós, Tamás
Date: 2007.
Published by: Airweb 2007. Banff, 2007.
- Who rated what: a combination of SVD, correlation and frequent sequence mining
- Authors: Kurucz, Miklós; Benczúr, András; Kiss, Tamas; Nagy, István II; Szabó, Adrienn; Torma, Balázs
Date: 2007.
Published by: KDDCup 2007. San José, 2007.
- Performing cross-language retrieval with wikipedia
- Authors: Schönhofen, Péter; Bíró, István; Benczúr, András; Csalogány, Károly
Editor: Nardi, A.; Peters, C.; Quochi, V.
Date: 2007.
Published by: CLEF 2007 workshop. Corss language system evaluation campaign. Budapest, 2007. (Page: 1)
- Overview of the imageCLEF 2007 object retrieval task
- Authors: Deselaers, Thomas; Hanbury, Allan; Viitaniemi, Ville; Benczúr, András; Brendel, Mátyás; Daróczy, Bálint Zoltán; Balderas, Hugo Jair Escalante; Gevers, Theo; Gracidas, Carlos Arturo Hernández; Hoi, Steven C. H.; Laaksonen, Jorma; Li, Mingjing; Castro, Heidy Marisol Marin; Ney, Hermann; Rui, Xiaoguang; Sebe, Nicu; Stöttinger, Julian; Wu, Lei
Editor: Nardi, A.; Peters, C.; Quochi, V.
Date: 2007.
Published by: CLEF 2007 workshop. Corss language system evaluation campaign. Budapest, 2007. (Page: 2)
- KDD Cup 2007 task 1 winner report
- Authors: Benczúr, András; Kurucz, Miklós; Kiss, Tamás; Nagy István, István II; Szabó, Adrienn; Torma, Balázs
Date: 2007.
Published by: KDD Cup 2007
- KDD cup 2007 task 1 winner report
- Authors: Kurucz, Miklós; Benczúr, András; Kiss, Tamás; Nagy István, István II; Szabó, Adrienn; Torma, Balázs
Date: 2007.
Published by: SIGKDD Explorations (Page: 5)
- Methods for large scale SVD with missing values
- Authors: Kurucz, Miklós; Benczúr, András; Csalogány, Károly
Date: 2007.
Published by: KDDCup 2007. San José, 2007.
- Cross-modal retrieval by text and image feature biclustering
- Authors: Benczúr, András; Bíró, István; Brendel, Mátyás; Csalogány, Károly; Daróczy, Bálint Zoltán; Siklósi, Dávid
Editor: Nardi, A.; Peters, C.; Quochi, V.
Date: 2007. 09.
Published by: CLEF 2007 workshop. Corss language system evaluation campaign. Budapest, 2007. (Page: 8)
2006.
- To randomize or not to randomize: space optimal summaries for hyperlink analysis
- Authors: Sarlós, Tamás; Benczúr, András; Csalogány, Károly; Fogaras, Dániel; Rácz, Balázs
Date: 2006.
Published by: WWW 2006. 15th international World Wide Web conference. Edinburgh, 2006. (Page: 2)
- Two-phase data warehouse optimized for data mining
- Authors: Rácz, Balázs; Sidló, Csaba István; Lukács, András; Benczúr, András
Editor: Busser, C; Catellanos, M; Navathe, S
Date: 2006.
Published by: VLDB 2006. First international workshop on business intelligence for the real time enterprise (BIRTE). Seoul, 2006. (Page: 6)
- PageRank és azon túl: Hiperhivatkozások szerepe a keresésben
- Authors: Benczúr, András; Bíró, István; Csalogány, Károly; Rácz, Balázs; Sarlós, Tamás
Date: 2006.
Published by: Magyar Tudomány (Page: 1)
- Link-based similarity search to fight web spam
- Authors: Benczúr, András; Csalogány, Károly; Sarlós, Tamás
Editor: Davison, BD; Najork, M; Converse, T
Date: 2006.
Published by: Airweb 2006. Proceedings of the 2nd international workshop on adversarial information retrieval on the web. Seattle, 2006. (Page: 9)
- Exploiting extremely rare features in text categorization
- Authors: Schönhofen, Péter; Benczúr, András
Date: 2006.
Published by: Lecture Notes in Artificial Intelligence (Page: 7)
- Detecting nepotistic links by language model disagreement
- Authors: Benczúr, András; Bíró, István; Csalogány, Károly; Uher, Máté
Date: 2006.
Published by: WWW 2006. 15th international World Wide Web conference. Edinburgh, 2006. (Page: 9)
2005.
- SpamRank - fully automatic link spam detection. Work in progress
- Authors: Benczúr, András; Csalogány, Károly; Sarlós, K.; Uher, M.
Date: 2005.
Published by: AIRWeb'05. First international workshop on adversarial information retrieval on the web. Chiba, 2005. (Page: 1)
- Primal-dual approach for directed vertex connectivity augmentation and generalizations
- Authors: Végh, LA; Benczúr, András
Date: 2005.
Published by: SODA 2005. Proceedings of the sixteenth annual ACM-SIAM symposium on discrete algorithms. Vancouver, 2005. (Page: 1)
- On the feasibility of low-rank approximation for personalized pagerank
- Authors: Benczúr, András; Csalogány, Károly; Sarlós, Tamás
Date: 2005.
Published by: WWW2005. 14th international World Wide Web conference. Chiba, 2005. (Page: 9)
- Feature selection based on word-sentence relation
- Authors: Schönhofen, Péter; Benczúr, András
Date: 2005.
Published by: ICMLA'05. 4th international conference on machine learning and applications. Proceedings. Los Angeles, 2005. (Page: 3)
2004.
- Magyar nyelvű tartalom a világhálón
- Authors: Benczúr, András; Csalogány, Károly; Fogaras, Dániel; Friedman, E.; Rácz, Balázs; Sarlós, Tamás; Uher, M.; Windhager, E.
Editor: Szeli, K
Date: 2004.
Published by: Információs társadalom internet információtechnika. (Kutatási jelentés 26) (Page: 4)
2003.
- Searching a small national domain - preliminary report
- Authors: Benczúr, András; Csalogány, Károly; Friedman, E.; Fogaras, Dániel; Sarlós, Tamás; Uher, M.
Date: 2003.
Published by: Proceedings of the twelfth international conference on World Wide Web, WWW2003. Budapest
- Pushdown-reduce: an algorithm for connectivity augmentation and poset covering problems
- Authors: Benczúr, András
Date: 2003.
Published by: DISCRETE APPLIED MATHEMATICS (Issue no.: 2, Page: 2)
- Formal description of a distributed location service for mobile ad hoc networks
- Authors: Benczúr, András; Glasser, U.; Lukovszki, T.
Date: 2003.
Published by: LECTURE NOTES IN COMPUTER SCIENCE (Page: 2)
2002.
- Algebra and computation at SZTAKI
- Authors: Benczúr, András; Ivanyos, Gábor; Rónyai, Lajos
Date: 2002.
Published by: ERCIM NEWS (Page: 3)
2000.
- Fast algorithms for even/odd minimum cuts and generalizations
- Authors: Benczúr, András; Fülöp, O.
Editor: Paterson, M
Date: 2000.
Published by: Algorithms - ESA 2000. (Lecture notes in computer science 1879) (Page: 8)
1999.
- Dilworth's theorem and its application for path systems of a cycle-implementation and analysis
- Authors: Benczúr, András; Förster, J.; Király, Z.
Editor: Nesetril, J
Date: 1999.
Published by: Algorithms - ESA '99. 7th annual European symposium. Prague, 1999. (Lecture notes in computer science, 1643.) (Page: 4)
1998.
- Augmenting undirected edge-connectivity in Ő(n2) time
- Authors: Benczúr, András; Karger, DR
Date: 1998.
Published by: Proceedings of the ninth annual ACM-SIAM symposium on discrete algorithms. San Francisco, 1998. (Page: 5)