Downloads
Applications for mobile by SZTAKI
Mouse 3D for Virca (iOS)
- Mouse3D enables you to control 3D (and 2D) applications in 3D (in mid-air) using your mobile phone or iPod touch always at hand. It is especially handy if you want to control the 3D space a few steps away from your screen, moving freely in your room without being bounded to your keyboard and mouse.
- description and download
GUIDE@HAND (iOS, Android)
Open source software by SZTAKI
SOAP MTP for Jade and Light Agent Web Service Integration Toolkit
- DSD, András Micsik, 2008
- This is an add-on for the Jade agent development framework which enables Jade platforms to communicate via SOAP messages instead of HTTP or IIOP. It uses Apache CXF for sending and receiving SOAP messages. The approach is pretty simple, it sends the ACL envelope and payload as two message parameters to the other Jade platform. There is no mapping of ACL envelope to SOAP headers, so SOAP headers remain free to be used as necessary by the hosting environment. It is easy to modify the conversion process of outgoing and incoming SOAP messages using CXF interceptors.
- description
- download
Reticular Alignment: algorithm for multiple sequence alignment
- DMS, István Miklós, Adrienn Szabó, 2010
- Reticular Alignment is our new method for for multiple sequence alignment. Unlike previous corner-cutting methods, our approach does not define a compact part of the dynamic programming table. Instead, it defines a set of optimal and suboptimal alignments at each step during the progressive alignment. The set of alignments are represented with a network to store them and use them during the progressive alignment in an efficient way. The program contains a threshold parameter on which the size of the network depends. The larger the threshold parameter and thus the network, the deeper the search in the alignment space for better scored alignments.
- description
- download
gUSE: grid User Support Enviroment
- LPDS
- gUSE (grid User Support Environment) is a grid virtualization environment providing a scalable set of high-level Grid services by which interoperation between Grids and user communities can be achieved. Incorporating a more flexible workflow concept and enabling its distribution on clusters and different Grid sites, gUSE is aimed to extend the objectives and features of WS-P-GRADE Portal.
- description
- official site
- download
DCI Bridge
- LPDS
- DCI-Bridge is implemented as a set of web services that bind together in flexible ways on demand to deliver user services in Grid and/or cluster and/or cloud and/or web services environments. DCI Bridge is a web application (sevice) that provides standard access to the distributed computing infrastructures (DCIs) like grids, desktop grids, clusters, clouds and service based computational resources by implementing the specification of the OGSA Basic Execution Service 1.0. The DCI Bridge web application creates a transparent layer between the users (workflow systems) and the DCI systems. The user can submit jobs to the various DCI systems using the OGSA Basic Execution Service (BES) interface defined here. As a result, the users do not have to learn the access protocol of the various DCI systems since they are hidden behind the BES interface. The DCI Bridge gives support to the use of the Meta Broker which is able to choose the most ideal environment for the job from the available execution resources. The DCI Bridge will do the task in this specified environment.
- description
- official site
3G Bridge
- LPDS
- The Generic Grid-Grid (3G) Bridge is an open-source core job bridging component between different grid infrastructures. Its development started in 2008 within the CancerGrid and EDGeS projects. The aim was to create a generic bridge component that can be used in different grid interoperability scenarios. The 3G Bridge used within the EDGeS project that provides the core component of the Service Grid - Desktop Grid interoperability solution. 3G Bridge helps to connect user communities of different grid systems. For example communities working on parameter sweep problems (physics, biologists, etc.) and using service grid infrastructures can migrate their applications to the more adequate desktop grid platform using the 3G Bridge technology, resulting in an accelerated research.
- description
- official site
- download
P-Grade
- LPDS
- The P-GRADE Grid Portal is a web based, service rich environment for the development, execution and monitoring of workflows and workflow based parameter studies on various grid platforms. P-GRADE Portal hides low-level grid access mechanisms by high-level graphical interfaces, making even non grid expert users capable of defining and executing distributed applications on multi-institutional computing infrastructures. Workflows and workflow based parameter studies defined in the P-GRADE Portal are portable between grid platforms without learning new systems or re-engineering program code. Technology neutral interfaces and concepts of the P-GRADE Portal help users cope with the large variety of grid solutions. More than that, any P-GRADE Portal installation can access multiple grids simultaneously, which enables the easy distribution of complex applications on several platforms.
- description
- official site
- download
SZTAKI Desktop Grid
- LPDS
- The goal of SZTAKI Desktop Grid is providing an enterprise solution to exploit PCs and clusters located at different sites of a company or institute, solving large scale distributed programs via an easy-to-use application programming interface. It is extended to include clusters as single powerful PCs and to hierarchically propagate work from one desktop grid to the other. SZTAKI Desktop Grid is basically a BOINC server packed in a Debian® package, to make server deployment as easy as possible. The SZTAKI Desktop Grid package is the main software component of a public, worldwide accessible desktop grid system. To effectively aid massive number of participants an extensive website component is part of this package.
- description
- official site
- download
GBAC
- LPDS
- The Generic BOINC Application Client (GBAC) is a virtualization (VirtualBox) based wrapper. Beyond its name it aims to be a generic framework providing virtualized environments for various distributed computing infrastructures (DCIs). It is based on the VBoxWrapper of BOINC, but GBAC was implemented using the DC-API Meta API and does not rely on any middleware specific functionalities, thus it is possible to use it on any DCI. Currently our implementation supports BOINC, Condor and XtremWeb middleware beside standalone execution.
- description
- download
MetaBroker
- LPDS
- The Generic Meta-Broker Service can be used to select a DCI for a service of a workflow. The broker selection for a user request is based on historical performance metrics of the brokers/submitters and on the latest aggregated background load of the appropriate DCI. The same interface and methodology is used to support multiple, diverse DCIs.
- description
- download
GenWrapper
- LPDS
- A generic BOINC wrapper for legacy applications utilizing GitBox (a variant of BusyBox): Use POSIX like shell scripting and built-in commands like tar, awk, sed, zip, etc. to control and execute your legacy application.
- description
- download
HunTag
- HLT, Gábor Recski, Dániel Varga
- a sequential tagger for NLP combining the linear classificator Liblinear and Hidden Markov Models
- official site
- download
Webcorpus Creator
- HLT, Attila Zséder, Dániel Varga
- A collection of scripts and programs for creating a webcorpus from crawled data.
- official site
- download
ISF2
- CIMLAB
- This modul connects the EMC2 (LinuxCNC) machine control software to the VirCA system and enables Incremental Sheet Forming (ISF) operations to be executed on milling machines and multiple axis robots.
- official site
- download
VirCA – Virtual Collaboration Arena
TP tool
Open datasets by SZTAKI
Wikipedia text dump
- DSD, Máté Pataki, 2012
- When seeking information on the web Wikipedia is an essential source. The English version features nearly 4 million articles. Studies show that it is also the number one source of plagiarism, so when we created our new translational plagiarism checker, we looked for a way to add this vast source of information to our database. We found that it is impossible to download the whole database in an easy to handle format (like HTML or plain text) and that all the available Mediawiki converters had some flaws. So we have written a Mediawiki XML dump to plain text converter, which we run every time a new database dump appears on the site and publish the text version for everybody to use.
- description
- download
ECML/PKDD 2010 Discovery Challenge Data Set
- DMS, András Benczúr, 2010
- This is a large collection of annotated Web hosts labeled by the Hungarian Academy of Sciences(English), European Archive Foundation (French) and L3S Hannover (German), see credits. The base data is a set of 23M pages in 99K hosts in the .EU domain.
- description
- download
LiveJournal data
- DMS, Miklós Kurucz, 2008
- The data set is intended for research purposes only and freely available as per Creative Commons Attribution-Noncommercial-Share Alike 3.0, which basically states that you are free to use the labels and that we make no warranties about them. You can download and use the data for research in any institution public or private. The "nc-sa" (non-commercial, share-alike) rule applies if you want to redistribute the labels publicly.
- description
- download
Temporal Features for Web Spam Detection
- DMS, Miklós Erdélyi, 2011
- Below you can find temporal features for Web spam detection calculated from monthly snapshots of the .uk domain between October 2006 and May 2007. The archives contain files in Weka's ARFF format, one for each snapshot pair, i.e., for October-November 2006, for November-December 2006, etc.
- description
- download
Webcorpora and Frequency dictionaries
- HLT, Attila Zséder, Gábor Recski
- Large-scale language resources for fifteen medium density European languages: Catalan, Czech, Croatian, Danish, Dutch, Finnish, Lithuanian, Norwegian, Polish, Portuguese, Romanian, Serbian, Slovak, Spanish, and Swedish
- official site
- download
hunNERwiki
- HLT, Dávid Nemeskey, Eszter Simon
- a silver standard corpus for Hungarian Named Entity Recognition
- official site
- download