Watershed-ng : an extensible distributed stream processing framework.

dc.contributor.authorRocha, Rodrigo
dc.contributor.authorHott, Bruno
dc.contributor.authorDias, Vinícius
dc.contributor.authorFerreira, Renato
dc.contributor.authorMeira Júnior, Wagner
dc.contributor.authorGuedes Neto, Dorgival Olavo
dc.date.accessioned2023-07-03T16:49:26Z
dc.date.available2023-07-03T16:49:26Z
dc.date.issued2016pt_BR
dc.description.abstractMost high-performance data processing (a.k.a. big data) systems allow users to express their computation using abstractions (like MapReduce), which simplify the extraction of parallelism from applications. Most frameworks, however, do not allow users to specify how communication must take place: That element is deeply embedded into the run-time system abstractions, making changes hard to implement. In this work, we describe Wathershed-ng, our re-engineering of the Watershed system, a framework based on the filter–stream paradigm and originally focused on continuous stream processing. Like other big-data environments, Watershed provided object-oriented abstractions to express computation (filters), but the implementation of streams was a run-time system element. By isolating stream functionality into appropriate classes, combination of communication patterns and reuse of common message handling functions (like compression and blocking) become possible. The new architecture even allows the design of new communication patterns, for example, allowing users to choose MPI, TCP, or shared memory implementations of communication channels as their problem demands. Applications designed for the new interface showed reductions in code size on the order of 50% and above in some cases. The performance results also showed significant improvements, because some implementation bottlenecks were removed in the re-engineering process.pt_BR
dc.identifier.citationROCHA, R. et al. Watershed-ng: an extensible distributed stream processing framework. Concurrency and Computation, v. 28, p. 2487-2502, jan. 2016. Disponível em: <https://onlinelibrary.wiley.com/doi/abs/10.1002/cpe.3779>. Acesso em: 03 maio 2023.pt_BR
dc.identifier.doihttps://doi.org/10.1002/cpe.3779pt_BR
dc.identifier.issn1532-0634
dc.identifier.urihttp://www.repositorio.ufop.br/jspui/handle/123456789/16842
dc.identifier.uri2https://onlinelibrary.wiley.com/doi/abs/10.1002/cpe.3779pt_BR
dc.language.isoen_USpt_BR
dc.rightsrestritopt_BR
dc.subjectDistributed systemspt_BR
dc.subjectWatershedpt_BR
dc.subjectBig datapt_BR
dc.subjectFrameworkspt_BR
dc.titleWatershed-ng : an extensible distributed stream processing framework.pt_BR
dc.typeArtigo publicado em periodicopt_BR
Arquivos
Pacote Original
Agora exibindo 1 - 1 de 1
Nenhuma Miniatura disponível
Nome:
ARTIGO_WatershedExtensibleDistributed.pdf
Tamanho:
1.48 MB
Formato:
Adobe Portable Document Format
Descrição:
Licença do Pacote
Agora exibindo 1 - 1 de 1
Nenhuma Miniatura disponível
Nome:
license.txt
Tamanho:
1.71 KB
Formato:
Item-specific license agreed upon to submission
Descrição: