ZENITH: Gestion de donnees scientifiques

Modern science such as agronomy, bio-informatics, and environmental science must deal with overwhelming amounts of experimental data. Such data must be processed (cleaned, transformed, analyzed) in all kinds of ways in order to draw new conclusions, prove scientific theories and produce knowledge. However, constant progress in scientific observational instruments and simulation tools creates a huge data overload. For example, climate modeling data are growing so fast that they will lead to collections of hundreds of exabytes expected by 2020. Scientific data is also very complex, in particular because of heterogeneous methods used for producing data, the uncertainty of captured data, the inherently multi-scale nature of many sciences and the growing use of imaging, resulting in data with hundreds of attributes, dimensions or descriptors. Processing and analyzing such massive sets of complex data is therefore a major challenge since solutions must combine new data management techniques with large-scale parallelism in cluster, grid or cloud environments.

Scientific data management is now on the agenda of a very active research community composed of scientists from different disciplines and data management researchers . For instance, the SciDB organization is building an open source database system for scientific data analytics.

The three main challenges of scientific data management can be summarized by:

  1. scale (big data, big applications);
  2. complexity (uncertain, multi-scale data with lots of dimensions),
  3. heterogeneity (in particular, data semantics heterogeneity).

The overall goal of Zenith is to address these challenges, by proposing innovative solutions with significant advantages in terms of scalability, functionality, ease of use, and performance. We plan to design and validate our solutions by working closely with scientific application partners. To further validate our solutions and extend the scope of our results, we also want to foster industrial collaborations, even in non scientific applications, provided that they exhibit similar challenges.

Members

Staff

Associates & Students

Regular Co-workers

  • Hervé Goëau, CIRAD
  • Christophe Pradal, CIRAD

Research Themes

Our approach is to capitalize on the principles of distributed data management. In particular, we plan to exploit: high-level languages as the basis for data independence and automatic optimization; data semantics (taxonomies, folksonomies, ontologies, …) to improve information retrieval and automate data integration; declarative languages (algebra, calculus) to manipulate data and workflows, with user-defined functions; and exploit user (social) profiles and relationships between participants to help recommendation. Furthermore, we will exploit highly distributed environments in particular, P2P for data sharing between participants and parallel processing to scale up in the cloud. To reflect our approach, we organize our research program in three complementary research themes:

  1. Data and Metadata Management. This theme addresses the problems of managing and integrating data and metadata with uncertainty, in particular, uncertain entity resolution and distributed probabilistic query processing.
  2. Data and process sharing. This theme addresses the problems of scientific data and processes in highly distributed and parallel environments, in particular, social-based P2P data sharing, recommendation and scientific workflow management.
  3. Scalable data analysis. Given the gap between the growth of computing power and that of data production, our ability to analyze these data is inevitably at stake. This theme addresses the scalability problem by investigating new data mining and content-based retrieval techniques that exploit parallelism in the cloud.

Major Publications

R. Akbarinia, P. Valduriez, G. Verger, Efficient Evaluation of SUM Queries Over Probabilistic Data. IEEE Transactions on Knowledge and Data Engineering, Data. Vol. 25, No. 4, 764-775, 2013.

M. El Dick, E. Pacitti, R. Akbarinia, B. Kemme, Building a Peer-to-Peer Content Distribution Network with High Performance, Scalability and Robustness, Information Systems, Vol. 36, No 2, p. 222-247, 2011.

P. Letessier, O. Buisson, A. Joly, N. Boujemaa, Scalable Mining of Small Visual Objects, ACM Multimedia Conf.,  2012.

E. Ogasawara, D. De Oliveira, P. Valduriez, J. Dias, F. Porto, M. Mattoso,An Algebraic Approach for Data-Centric Scientific Workflows, Proceedings of VLDB, Vol. 4, No 11, p. 1328-1339, 2011. 

F. Petitjean, F. Masseglia, P. Gançarski, G. Forestier, Discovering Significant Evolution Patterns from Satelllite Image Time Series, International Journal of Neural Systems, Vol. 21, No 6, 475-489, 2011.

Complete list of publications

Publications 2014 - 2019: Evaluation period

International Journals

2019

  1. Keeping Track of User Steering Actions in Dynamic Workflows
    Renan Souza, Vítor Silva, José Camata, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2019, 99, pp.624-643.
  2. HydroShoot: a functional-structural plant model for simulating hydraulic structure, gas and energy exchange dynamics of complex plant canopies under water deficit - application to grapevine (Vitisvinifera L.)
    R. Albasha, C. Fournier, C Pradal, M Chelle, Jorge Alejando Prieto, G. Louarn, T Simonneau, E Lebon
    in silico Plants, Oxford Academic, 2019. <10.1093/insilicoplants/diz007>
  3. Changes in the vertical distribution of leaf area enhanced light interception efficiency in maize over generations of selection
    Raphael Perez, Christian Fournier, Llorenç Cabrera-Bosquet, Simon Artzet, Christophe Pradal, Nicolas Brichet, Tsu-Wei Chen, Romain Chapuis, Claude Welcker, Francois Tardieu
    Plant, Cell and Environment, Wiley, 2019, 42 (7), pp.2105-2119.
  4. Genetic and environmental dissection of biomass accumulation in multi-genotype maize canopies
    Tsu-Wei Chen, Llorenç Cabrera-Bosquet, Santiago Alvarez Prado, Raphael Perez, Simon Artzet, Christophe Pradal, Aude Coupel-Ledru, Christian Fournier, Francois Tardieu
    Journal of Experimental Botany, Oxford University Press (OUP), 2019, 70 (9), pp.2523-2534.
  5. Current knowledge and future research opportunities for modeling annual crop mixtures. A review
    Noémie Gaudio, Abraham Escobar-Gutiérrez, Pierre Casadebaig, Jochem Evers, Frederic Gerard, Gaëtan Louarn, Nathalie Colbach, Sebastian Munz, Marie Launay, Hélène Marrou, Romain Barillot, Philippe Hinsinger, Jacques-Eric Bergez, Didier Combes, Jean-Louis Durand, Ela Frak, Loic Pagès, Christophe Pradal, Sébastien Saint-Jean, Wopke van der Werf, Eric Justes
    Agronomy for Sustainable Development, Springer Verlag/EDP Sciences/INRA, 2019, 39 (2). <10.1007/s13593-019-0562-6>
  6. Toward a large-scale and deep phenological stage annotation of herbarium specimens: Case studies from temperate, tropical, and equatorial floras
    Titouan Lorieul, Katelin Pearson, Elizabeth Ellwood, Hervé Goëau, Jean‐francois Molino, Patrick Sweeney, Jennifer Yost, Joel Sachs, Gil Nelson, Pamela Soltis, Pierre Bonnet, Alexis Joly, Erick Mata-Montero
    Applications in Plant Sciences, Wiley, 2019, 7 (3), pp.e01233.
  7. CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning
    Fabian Robert-Stöter, Soumitro Chakrabarty, Bernd Edler, Emanuël Habets
    IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2019, 27 (2), pp.268-282.
  8. Massively Distributed Time Series Indexing and Querying
    Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Themis Palpanas
    IEEE Transactions on Knowledge and Data Engineering, Institute of Electrical and Electronics Engineers, 2019, pp.1-14.
  9. Privacy-Preserving Top-k Query Processing in Distributed Systems
    Sakina Mahboubi, Reza Akbarinia, Patrick Valduriez
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2019.
  10. VersionClimber: version upgrades without tears
    Christophe Pradal, Sarah Cohen-Boulakia, Patrick Valduriez, Dennis Shasha
    Computing in Science & Engineering, IEEE, In press, 21 (5), pp.87-93.
  11. MuSCA: a multi-scale source-sink carbon allocation model to 2 explore carbon allocation in plants. An application on static apple-tree
    Francesco Reyes, Benoit Pallas, Christophe Pradal, Federico Vaggi, D Zanotelli, Tagliavini Marco, D. Gianelle, Evelyne Costes
    Annals of Botany, Oxford University Press (OUP), In press.
  12. Parallel Computation of PDFs on Big Spatial Data Using Spark
    Ji Liu, Noel Moreno Lemus, Esther Pacitti, Fábio Porto, Patrick Valduriez
    Distributed and Parallel Databases, Springer, In press, pp.1-38.
  13. Musical Source Separation: An Introduction
    Estefania Cano, Derry Fitzgerald, Antoine Liutkus, Mark Plumbley, Fabian Robert-Stöter
    IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers, 2019, 36 (1), pp.31-40.

2018

  1. ParCorr: efficient parallel methods to identify similar time series pairs across sliding windows
    Djamel-Edine Yagoubi, Reza Akbarinia, Boyan Kolev, Oleksandra Levchenko, Florent Masseglia, Patrick Valduriez, Dennis Shasha
    Data Mining and Knowledge Discovery, Springer, 2018, 32 (5), pp.1481-1507.
  2. DfAnalyzer: Runtime Dataflow Analysis of Scientific Applications using Provenance
    Vítor Silva, Daniel de Oliveira, Patrick Valduriez, Marta Mattoso
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2018, 11 (12), pp.2082-2085.
  3. AutoWIG: automatic generation of python bindings for C++ libraries
    Pierre Fernique, Christophe Pradal
    PeerJ Computer Science, PeerJ, 2018, 4, pp.e149.
  4. Non-parametric Bayesian annotator combination
    Maximilien Servajean, Romain Chailan, Alexis Joly
    Information Sciences, Elsevier, 2018, 436-437, pp.131-145.
  5. Species distribution modeling based on the automated identification of citizen observations
    Christophe Botella, Alexis Joly, Pierre Bonnet, Pascal Monestiez, François Munoz
    Applications in Plant Sciences, Wiley, 2018, Green Digitization: Online Botanical Collections Data Answering Real‐World Questions, 6 (2), pp.1-11.
  6. Distributed Management of Scientific Workflows for High-Throughput Plant Phenotyping
    Christophe Pradal, Sarah Cohen-Boulakia, Gaetan Heidsieck, Esther Pacitti, Francois Tardieu, Patrick Valduriez
    ERCIM News, ERCIM, 2018, Smart Farming, pp.36-37.
  7. In situ visualization and data analysis for turbidity currents simulation
    José Camata, Vitor Silva, Patrick Valduriez, Marta Mattoso, Alvaro Coutinho
    Computers & Geosciences, Elsevier, 2018, 110, pp.23-31.
  8. Efficient Scheduling of Scientific Workflows using Hot Metadata in a Multisite Cloud
    Ji Liu, Luis Pineda, Esther Pacitti, Alexandru Costan, Patrick Valduriez, Gabriel Antoniu, Marta Mattoso
    IEEE Transactions on Knowledge and Data Engineering, Institute of Electrical and Electronics Engineers, In press. <10.1109/TKDE.2018.2867857>
  9. A Survey of Scheduling Frameworks in Big Data Systems
    Ji Liu, Esther Pacitti, Patrick Valduriez
    International Journal of Cloud Computing, Inderscience Publishers, 2018, 7 (2), pp.103-128.
  10. An Overview of Lead and Accompaniment Separation in Music
    Zafar Rafii, Antoine Liutkus, Fabian Robert-Stöter, Stylianos Ioannis Mimilakis, Derry Fitzgerald, Bryan Pardo
    IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2018, 26 (8), pp.1307-1335.

2017

  1. Going deeper in the automated identification of Herbarium specimens
    Jose Carranza-Rojas, Hervé Goëau, Pierre Bonnet, Erick Mata-Montero, Alexis Joly
    BMC Evolutionary Biology, BioMed Central, 2017, 17 (1), pp.181.
  2. A robot-assisted imaging pipeline for tracking the growths of maize ear and silks in a high-throughput phenotyping platform
    Nicolas Brichet, Christian Fournier, Olivier Turc, Olivier Strauss, Simon Artzet, Christophe Pradal, Claude Welcker, Francois Tardieu, Llorenç Cabrera-Bosquet
    Plant Methods, BioMed Central, 2017, 13 (1), pp.12.
  3. Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities
    Sarah Cohen-Boulakia, Khalid Belhajjame, Olivier Collin, Jérôme Chopard, Christine Froidevaux, Alban Gaignard, Konrad Hinsen, Pierre Larmande, Yvan Le Bras, Frédéric Lemoine, Fabien Mareuil, Hervé Ménager, Christophe Pradal, Christophe Blanchet
    Future Generation Computer Systems, Elsevier, 2017, 75, pp.284-298.
  4. Crowdsourcing Thousands of Specialized Labels: A Bayesian Active Training Approach
    Maximilien Servajean, Alexis Joly, Dennis Shasha, Julien Champ, Esther Pacitti
    IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers, 2017, 19 (6), pp.1376-1391.
  5. InfraPhenoGrid: A scientific workflow infrastructure for Plant Phenomics on the Grid
    Christophe Pradal, Simon Artzet, Jerome Chopard, Dimitri Dupuis, Christian Fournier, Michael Mielewczik, Vincent Negre, Pascal Neveu, Didier Parigot, Patrick Valduriez, Sarah Cohen-Boulakia
    Future Generation Computer Systems, Elsevier, 2017, 67, pp.341-353.
  6. A Highly Scalable Parallel Algorithm for Maximally Informative k-Itemset Mining
    Saber Salah, Reza Akbarinia, Florent Masseglia
    Knowledge and Information Systems (KAIS), Springer, 2017, 50 (1), pp.1-26.
  7. Scientific Workflow Scheduling with Provenance Data in a Multisite Cloud
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2017, 33, pp.80-112.
  8. Data reduction in scientific workflows using provenance monitoring and user steering
    Renan Souza, Vitor Silva, Alvaro L.G.A. Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, In press. <10.1016/j.future.2017.11.028>
  9. Raw data queries during data-intensive parallel workflow execution
    Vítor Silva, José Leite, José Camata, Daniel de Oliveira, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2017, 75, pp.402-422.
  10. Data placement in massively distributed environments for fast parallel mining of frequent itemsets
    Saber Salah, Reza Akbarinia, Florent Masseglia
    Knowledge and Information Systems (KAIS), Springer, 2017, 53 (1), pp.207-237.

2016

  1. CloudMdsQL: Querying Heterogeneous Cloud Data Stores with a Common Language
    Boyan Kolev, Patrick Valduriez, Carlyna Bondiombouy, Ricardo Jiménez-Peris, Raquel Pau, José Pereira
    Distributed and Parallel Databases, Springer, 2016, 34 (4), pp.463-503.
  2. AgroLD API. Une architecture orientée services pour l'extraction de connaissances dans la base de données liées AgroLD
    Gildas Tagny Ngompe, Aravind Venkatesan, Nordine El Hassouni, Manuel Ruiz, Pierre Larmande
    Revue des Sciences et Technologies de l'Information - Série ISI : Ingénierie des Systèmes d'Information, Lavoisier, 2016, 21 (5-6), pp.133-158.
  3. Categorizing plant images at the variety level: Did you say fine-grained?
    Julien Champ, Titouan Lorieul, Pierre Bonnet, Najate Maghnaoui, Christophe Sereno, Thierry Dessup, Jean-Michel Boursiquot, Laurent Audeguin, Thierry Lacombe, Alexis Joly
    Pattern Recognition Letters, Elsevier, 2016, 81, pp.71-79.
  4. Database System Support of Simulation Data
    Hermano Lustosa, Fabio Porto, Pablo Blanco, Patrick Valduriez
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2016, 9 (13), pp.1329-1340.
  5. Gigwa—Genotype investigator for genome- wide analyses
    Guilhem Sempéré, Florian Philippe, Alexis Dereeper, Manuel Ruiz, Gautier Sarah, Pierre Larmande
    GigaScience, BioMed Central, 2016, 5 (1). <10.1186/s13742-016-0131-8>
  6. Social Networks and Information Retrieval, How Are They Converging? A Survey, a Taxonomy and an Analysis of Social Information Retrieval Approaches and Platforms
    Mohamed Reda Bouadjenek, Hakim Hacid, Mokrane Bouzeghoub
    Information Systems, Elsevier, 2016, 56, pp.1-18.
  7. Guest Editorial: Environmental Multimedia Retrieval
    Stefanos Vrochidis, Kostas D. Karatzas, Ari Karppinen, Alexis Joly
    Multimedia Tools and Applications, Springer Verlag, 2016, 75 (3), pp.1557-1562.
  8. A look inside the Pl@ntNet experience
    Alexis Joly, Pierre Bonnet, Hervé Goëau, Julien Barbe, Souheil Selmi, Julien Champ, Samuel Dufour-Kowalski, Antoine Affouard, Jennifer Carré, Jean-François Molino, Nozha Boujemaa, Daniel Barthélémy
    Multimedia Systems, Springer Verlag, 2016, 22 (6), pp.751-766.
  9. Multistore Big Data Integration with CloudMdsQL
    Carlyna Bondiombouy, Boyan Kolev, Oleksandra Levchenko, Patrick Valduriez
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2016, 28, pp.48-74.
  10. Query processing in multistore systems: an overview
    Carlyna Bondiombouy, Patrick Valduriez
    International Journal of Cloud Computing, Inderscience Publishers, 2016, pp.38.
  11. Analyzing Related Raw Data Files through Dataflows
    Vitor Silva, Daniel De Oliveira, Patrick Valduriez, Marta Mattoso
    Concurrency and Computation: Practice and Experience, Wiley, 2016, 28 (8), pp.2528-2545.
  12. Multi-Objective Scheduling of Scientific Workflows in Multisite Clouds
    Ji Liu, Esther Pacitti, Patrick Valduriez, Daniel de Oliveira, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2016, 63, pp.76-95.
  13. Effective and Efficient Similarity Search in Scientific Workflow Repositories
    Johannes Starlinger, Sarah Cohen-Boulakia, Sanjeev Khanna, Susan Davidson, Ulf Leser
    Future Generation Computer Systems, Elsevier, 2016, 56, pp.584-594.
  14. FP-Hadoop: Efficient Processing of Skewed MapReduce Jobs
    Miguel Liroz-Gistau, Reza Akbarinia, Divyakant Agrawal, Patrick Valduriez
    Information Systems, Elsevier, 2016, 60, pp.69-84.
  15. Plant identification: Man vs. Machine
    Pierre Bonnet, Alexis Joly, Hervé Goëau, Julien Champ, Christel Vignau, Jean-François Molino, Daniel Barthélémy, Nozha Boujemaa
    Multimedia Tools and Applications, Springer Verlag, 2016, LifeCLEF 2014 plant identification challenge, 75 (3), pp.1647-1665.

2015

  1. A Survey of Data-Intensive Scientific Workflow Management
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    Journal of Grid Computing, Springer Verlag, 2015, 13 (4), pp.457-493.
  2. Increasing Coverage in Distributed Search and Recommendation with Profile Diversity
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2015, LNCS (9430), pp.115-144.
  3. Rank aggregation with ties: Experiments and Analysis
    Bryan Brancotte, Bo Yang, Guillaume Blin, Sarah Cohen-Boulakia, Alain Denise, Sylvie Hamel
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2015, 8 (11), pp.1202-1213.
  4. Profile Diversity for Query Processing using User Recommendations
    Maximilien Servajean, Reza Akbarinia, Esther Pacitti, Sihem Amer-Yahia
    Information Systems, Elsevier, 2015, Information Systems, 48, pp.44-63.
  5. FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data
    Miguel Liroz-Gistau, Reza Akbarinia, Patrick Valduriez
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2015, 8 (12), pp.1856-1867.
  6. Data-Centric Iteration in Dynamic Workflows
    Jonas Dias, Gabriel Guerra, Fernando Rochinha, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2015, 46, pp.114-126.

2014

  1. Special section on data-intensive cloud infrastructure
    Ashraf Aboulnaga, Beng Chin Ooi, Patrick Valduriez
    The VLDB Journal, Springer, 2014, 23 (6), pp.843-843.
  2. Autonomic Intrusion Detection: Adaptively Detecting Anomalies over Unlabeled Audit Data Streams in Computer Networks
    Wei Wang, Thomas Guyet, René Quiniou, Marie-Odile Cordier, Florent Masseglia, Xiangliang Zhang
    Knowledge-Based Systems, Elsevier, 2014, 70, pp.103-117.
  3. The anti-bouncing data stream model for web usage streams with intralinkings
    Chongsheng Zhang, Florent Masseglia, Yves Lechevallier
    Information Sciences, Elsevier, 2014, 278, pp.757-772.
  4. Evaluation of Direct Manipulation using Finger Tracking for Complex Tasks in an Immersive Cube
    Emmanuelle Chapoulie, Maud Marchal, Evanthia Dimara, Maria Roussou, Jean-Christophe Lombardo, George Drettakis
    Virtual Reality, Springer Verlag, 2014, 18 (3), pp.203-217.
  5. Similarity Search for Scientific Workflows
    Johannes Starlinger, Bryan Brancotte, Sarah Cohen-Boulakia, Ulf Leser
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2014, 7 (12), pp.1143-1154.
  6. Query Reformulation in PDMS Based on Social Relevance
    Angela Bonifati, Gianvito Summa, Esther Pacitti, Fady Draidi
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2014, Transactions on Large-Scale Data- and Knowledge-Centered Systems XIII, LNCS, pp.59-90.
  7. Interactive plant identification based on social image data
    Alexis Joly, Hervé Goëau, Pierre Bonnet, Vera Bakić, Julien Barbe, Souheil Selmi, Itheri Yahiaoui, Jennifer Carré, Elise Mouysset, Jean-François Molino, Nozha Boujemaa, Daniel Barthélémy
    Ecological Informatics, Elsevier, 2014, 23, pp.22-34.
  8. Entity Resolution for Probabilistic Data
    Ayat Naser, Reza Akbarinia, Hamideh Afsarmanesh, Patrick Valduriez
    Information Sciences, Elsevier, 2014, 277, pp.492-511.
  9. Object-based visual query suggestion
    Amel Hamzaoui, Pierre Letessier, Alexis Joly, Olivier Buisson, Nozha Boujemaa
    Multimedia Tools and Applications, Springer Verlag, 2014, Multimedia Tools and Applications, 68 (2), pp.429-454.

International Communications

2019

  1. SAVIME: A Database Management System for Simulation Data Analysis and Visualization
    Hermano Lustosa, Fabio Porto, Patrick Valduriez
    SBBD: Simpósio Brasileiro de Banco de Dados, SBC, Oct 2019, Fortaleza, Brazil. pp.1-12.
  2. Efficient Runtime Capture of Multiworkflow Data Using Provenance
    Renan Souza, Leonardo Azevedo, Raphael Thiago, Elton Soares, Marcelo Nery, Marco Netto, Emilio Vital Brazil, Renato Cerqueira, Patrick Valduriez, Marta Mattoso
    eScience 2019 : 15th International eScience Conference, Sep 2019, San Diego, United States. pp.1-10.
  3. Querying Key-Value Stores under Single-Key Constraints: Rewriting and Parallelization
    Olivier Rodriguez, Reza Akbarinia, Federico Ulliana
    RuleML+RR, Sep 2019, Bolzano, Italy. <https://rulemlrr19.inf.unibz.it/>
  4. Distributed Algorithms to Find Similar Time Series
    Oleksandra Levchenko, Boyan Kolev, Djamel-Edine Yagoubi, Dennis Shasha, Themis Palpanas, Patrick Valduriez, Reza Akbarinia, Florent Masseglia
    ECML-PKDD : European Conference on Machine Learning and Knowledge Discovery in Databases, Sep 2019, Wurtzbourg, Germany.
  5. Overview of GeoLifeCLEF 2019: plant species prediction using environment and animal occurrences
    Christophe Botella, Maximilien Servajean, Pierre Bonnet, Alexis Joly
    CLEF 2019 - Conference and Labs of the Evaluation Forum, Sep 2019, Lugano, Switzerland. <http://clef2018.clef-initiative.eu/>
  6. Overview of LifeCLEF 2019: Identification of Amazonian Plants, South & North American Birds, and Niche Prediction
    Alexis Joly, Hervé Goëau, Christophe Botella, Stefan Kahl, Maximilien Servajean, Hervé Glotin, Pierre Bonnet, Robert Planqué, Fabian Robert-Stöter, Willem-Pier Vellinga, Henning Müller
    CLEF: Cross-Language Evaluation Forum, Sep 2019, Lugano, Switzerland. pp.387-401.
  7. Overview of LifeCLEF Plant Identification task 2019: diving into data deficient tropical countries
    Hervé Goëau, Pierre Bonnet, Alexis Joly
    CLEF 2019: Conference and Labs of the Evaluation Forum, Linda Cappellato; Nicola Ferro; David E. Losada; Henning Müller, Sep 2019, Lugano, Switzerland. pp.1-13.
  8. Adaptive Caching for Data-Intensive Scientific Workflows in the Cloud
    Gaetan Heidsieck, Daniel de Oliveira, Esther Pacitti, Christophe Pradal, Francois Tardieu, Patrick Valduriez
    DEXA 2019 - 30th International Conference on Database and Expert Systems Applications, Aug 2019, Linz, Austria.
  9. Sliced-Wasserstein Flows: Nonparametric Generative Modeling via Optimal Transport and Diffusions
    Antoine Liutkus, Umut Imşekli, Szymon Majewski, Alain Durmus, Fabian Robert-Stöter
    International Conference on Machine Learning, Jun 2019, Long Beach, California, United States.
  10. Speech enhancement with variational autoencoders and alpha-stable distributions
    Simon Leglaive, Umut Simsekli, Antoine Liutkus, Laurent Girin, Radu Horaud
    ICASSP 2019 - International Conference on Acoustics Speech and Signal Processing, May 2019, Brighton, United Kingdom. pp.541-545.
  11. Parallel Streaming Implementation of Online Time Series Correlation Discovery on Sliding Windows with Regression Capabilities
    Boyan Kolev, Reza Akbarinia, Ricardo Jimenez-Peris, Oleksandra Levchenko, Florent Masseglia, Marta Patino, Patrick Valduriez
    CLOSER : International Conference on Cloud Computing and Services Science, May 2019, Heraklion, Greece. pp.681-687.
  12. LifeCLEF 2019: Biodiversity Identification and Prediction Challenges
    Alexis Joly, Hervé Goëau, Christophe Botella, Stefan Kahl, Marion Poupard, Maximilien Servajean, Hervé Glotin, Pierre Bonnet, Willem-Pier Vellinga, Robert Planqué, Jan Schlüter, Fabien-Robert Stöter, Henning Müller
    41st European Conference on IR Research, ECIR 2019, Apr 2019, Cologne, Germany. pp.275-282.
  13. Dirichlet Process Mixture Models made Scalable and Effective by means of Massive Distribution
    Khadidja Meguelati, Bénédicte Fontez, Nadine Hilgert, Florent Masseglia
    SAC: Symposium on Applied Computing, Apr 2019, Limassol, Cyprus. pp.502-509.

2018

  1. Parallel Polyglot Query Processing on Heterogeneous Cloud Data Stores with LeanXcale
    Boyan Kolev, Oleksandra Levchenko, Esther Pacitti, Patrick Valduriez, Ricardo Vilaça, Rui Gonçalves, Ricardo Jiménez-Peris, Pavlos Kranas
    IEEE BigData, Dec 2018, Seattle, United States. pp.10.
  2. Spark-parSketch: A Massively Distributed Indexing of Time Series Datasets
    Oleksandra Levchenko, Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Boyan Kolev, Dennis Shasha
    CIKM: Conference on Information and Knowledge Management, Oct 2018, Turin, Italy. pp.1951-1954.
  3. SiSEC 2018: State of the art in musical audio source separation - subjective selection of the best algorithm
    Dominic Ward, Russel D. Mason, Chungeun Kim, Fabian Robert-Stöter, Antoine Liutkus, Mark Plumbley
    WIMP: Workshop on Intelligent Music Production, Sep 2018, Huddersfield, United Kingdom. <http://epubs.surrey.ac.uk/id/eprint/849086>
  4. Overview of LifeCLEF 2018: A Large-Scale Evaluation of Species Identification and Recommendation Algorithms in the Era of AI
    Alexis Joly, Hervé Goëau, Christophe Botella, Hervé Glotin, Pierre Bonnet, Willem-Pier Vellinga, Robert Planqué, Henning Müller
    CLEF: Cross-Language Evaluation Forum, Sep 2018, Avignon, France. pp.247-266.
  5. Overview of BirdCLEF 2018: monospecies vs. soundscape bird identification
    Hervé Goëau, Stefan Kahl, Hervé Glotin, Robert Planqué, Willem-Pier Vellinga, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2018, Avignon, France. <http://ceur-ws.org/Vol-2125/invited_paper_9.pdf>
  6. Location-based species recommendation using co-occurrences and environment-GeoLifeCLEF 2018 challenge
    Benjamin Deneu, Maximilien Servajean, Christophe Botella, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2018, Avignon, France. <http://ceur-ws.org/Vol-2125/paper_119.pdf>
  7. Overview of GeoLifeCLEF 2018: location-based species recommendation
    Christophe Botella, Pierre Bonnet, François Munoz, Pascal Monestiez, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2018, Avignon, France. <http://ceur-ws.org/Vol-2125/invited_paper_8.pdf>
  8. Overview of ExpertLifeCLEF 2018: how far automated identification systems are from the best experts?
    Hervé Goëau, Pierre Bonnet, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2018, Avignon, France. <http://ceur-ws.org/Vol-2125/invited_paper_10.pdf>
  9. Answering Top-k Queries over Outsourced Sensitive Data in the Cloud
    Sakina Mahboubi, Reza Akbarinia, Patrick Valduriez
    DEXA: Database and Expert Systems Applications, Sep 2018, Regensburg, Germany. pp.218-231.
  10. Discovering Tight Space-Time Sequences
    Riccardo Campisano, Heraldo Borges, Fábio Porto, Fabio Perosi, Esther Pacitti, Florent Masseglia, Eduardo Ogasawara
    DaWaK: Data Warehousing and Knowledge Discovery, Sep 2018, Regensburg, Germany. pp.247-257.
  11. Privacy-Preserving Top-k Query Processing in Distributed Systems
    Sakina Mahboubi, Reza Akbarinia, Patrick Valduriez
    Euro-Par: European Conference on Parallel and Distributed Computing, Aug 2018, Turin, Italy. pp.281-292.
  12. Computation of PDFs on Big Spatial Data: Problem & Architecture
    Ji Liu, Noel Lemus, Esther Pacitti, Fábio Porto, Patrick Valduriez
    LADaS: Latin America Data Science Workshop, Aug 2018, Rio de Janeiro, Brazil. pp.6.
  13. Scientific Data Analysis Using Data-Intensive Scalable Computing: the SciDISC Project
    Patrick Valduriez, Marta Mattoso, Reza Akbarinia, Heraldo Borges, José Camata, Alvaro Coutinho, Daniel Gaspar, Noel Lemus, Ji Liu, Hermano Lustosa, Florent Masseglia, Fabricio Nogueira da Silva, Vitor Silva, Renan Souza, Kary Ocaña, Eduardo Ogasawara, Daniel Oliveira, Esther Pacitti, Fábio Porto, Dennis Shasha
    LADaS: Latin America Data Science Workshop, Aug 2018, Rio de Janeiro, Brazil. <http://ceur-ws.org/Vol-2170>
  14. F ReeP: towards parameter recommendation in scientific workflows using preference learning
    Daniel Silva, Aline Paes, Esther Pacitti, Daniel de Oliveira
    SBBD: Simpósio Brasileiro de Banco de Dados, Aug 2018, Rio de Janeiro, Brazil. pp.211-216.
  15. Rumo à Integração da Álgebra de Workflows com o Processamento de Consulta Relacional
    João Ferreira, Jorge Soares, Fábio Porto, Esther Pacitti, Rafaelli Coutinho, Eduardo Ogasawara
    SBBD: Simpósio Brasileiro de Banco de Dados, SBC, Aug 2018, Rio de Janeiro, Brazil. pp.205-210.
  16. Detecçao de Anomalias Frequentes no Transporte Rodoviario Urbano
    Ana Cruz, João Ferreira, Diego Carvalho, Eduardo Mendes, Esther Pacitti, Rafaelli Coutinho, Fábio Porto, Eduardo Ogasawara
    SBBD: Simpósio Brasileiro de Banco de Dados, SBC, Aug 2018, Rio de Janeiro, Brazil. pp.271-276.
  17. Constellation Queries over Big Data
    Fábio Porto, Amir Khatibi, Joao Rittmeyer, Eduardo Ogasawara, Patrick Valduriez, Dennis Shasha
    SBBD: Simpósio Brasileiro de Banco de Dados, SBC, Aug 2018, Rio de Janeiro, Brazil. pp.85-96.
  18. Point Pattern Search in Big Data
    Fabio Porto, Joao Rittmeyer, Eduardo Ogasawara, Alberto Krone-Martins, Patrick Valduriez, Dennis Shasha
    SSDBM: Scientific and Statistical Database Management, Jul 2018, Bozen-Bolzano, Italy. pp.#21.
  19. The 2018 Signal Separation Evaluation Campaign
    Fabian Robert-Stöter, Antoine Liutkus, Nobutaka Ito
    LVA/ICA: Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. pp.293-305.
  20. Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition
    Mathieu Fontaine, Fabian Robert-Stöter, Antoine Liutkus, Umut Simsekli, Romain Serizel, Roland Badeau
    LVA/ICA: Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. pp.13-23.
  21. A Distributed Collaborative Filtering Algorithm Using Multiple Data Sources
    Mohamed Bouadjenek, Esther Pacitti, Maximilien Servajean, Florent Masseglia, Amr Abbadi
    DBKDA: Advances in Databases, Knowledge, and Data Applications, May 2018, Nice, France. <https://www.iaria.org/conferences2018/DBKDA18.html>
  22. A Differentially Private Index for Range Query Processing in Clouds
    Cetin Sahin, Tristan Allard, Reza Akbarinia, Amr Abbadi, Esther Pacitti
    ICDE: International Conference on Data Engineering, Apr 2018, Paris, France. pp.857-868.
  23. Alpha-stable low-rank plus residual decomposition for speech enhancement
    Umut Simsekli, Halil Erdogan, Simon Leglaive, Antoine Liutkus, Roland Badeau, Gaël Richard
    ICASSP: International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. pp.651-655.
  24. Blind Source Separation Using Mixtures of Alpha-Stable Distributions
    Nicolas Keriven, Antoine Deleforge, Antoine Liutkus
    ICASSP: International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. pp.771-775.
  25. Interference reduction on full-length live recordings
    Diego Di Carlo, Antoine Liutkus, Ken Déguernel
    ICASSP: International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. pp.736-740.
  26. Audio source separation with magnitude priors: the BEADS model
    Antoine Liutkus, Christian Rohlfing, Antoine Deleforge
    ICASSP: International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. pp.56-60.
  27. Maximally Informative k-Itemset Mining from Massively Distributed Data Streams
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    SAC: Symposium on Applied Computing, Apr 2018, Pau, France. pp.502-509.
  28. The role of hydraulics FSPMs in the context of root breeding : a case study on Pearl Millet
    Adama Ndour, Christophe Pradal, Vincent Vadez, Sixtine Passot, Yann Guédon, Laurent Laplaze, Mikael Lucas
    EGU: European Geosciences Union, Apr 2018, Vienne, Austria. pp.2018-19792.

2017

  1. DPiSAX: Massively Distributed Partitioned iSAX
    Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Themis Palpanas
    ICDM: International Conference on Data Mining, Nov 2017, New Orleans, United States. pp.1135-1140.
  2. Querying Key-Value Stores Under Simple Semantic Constraints : Rewriting and Parallelization
    Olivier Rodriguez, Corentin Colomier, Cecilie Rivière, Reza Akbarinia, Federico Ulliana
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. <https://project.inria.fr/bda2017/>
  3. Massively Distributed Environments and Closed Itemset Mining: The DCIM Approach
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. pp.1-15.
  4. End-to-end Graph Mapper
    Benjamin Billet, Mickaël Jurret, Didier Parigot, Patrick Valduriez
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. <https://project.inria.fr/bda2017/>
  5. Efficient Scheduling of Scientific Workflows using Hot Metadata in a Multisite Cloud
    Ji Liu, Luis Pineda-Morales, Esther Pacitti, Alexandru Costan, Patrick Valduriez, Gabriel Antoniu, Marta Mattoso
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. <https://project.inria.fr/bda2017/>
  6. Tracking of Online Parameter Fine-tuning in Scientific Workflows
    Renan Souza, Vitor Silva, José Camata, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    WORKS: Workflows in Support of Large-scale Science, Nov 2017, Denver, United States.
  7. TARS: An Array Model with Rich Semantics for Multidimensional Data
    Hermano Lustosa, Noel Lemus, Fabio Porto, Patrick Valduriez
    Forum and Demos at ER, Nov 2017, Valencia, Spain. pp.114-127.
  8. Pl@ntNet -My Business
    Alexis Joly, Pierre Bonnet, Antoine Affouard, Jean-Christophe Lombardo, Hervé Goëau
    MM: Multimedia, Oct 2017, Mountain View, United States. pp.1-11.
  9. RadiusSketch: Massively Distributed Indexing of Time Series
    Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Dennis Shasha
    DSAA: Data Science and Advanced Analytics, Oct 2017, Tokyo, Japan. pp.1-10.
  10. Spark Scalability Analysis in a Scientific Workflow
    Renan Souza, Vitor Silva, Pedro Miranda, Alexandre Lima, Patrick Valduriez, Marta Mattoso
    SBBD: Simpósio Brasileiro de Banco de Dados, Oct 2017, Uberlandia, Brazil. pp.1-6.
  11. Automated Herbarium Specimen Identification using Deep Learning
    Jose Carranza-Rojas, Alexis Joly, Pierre Bonnet, Hervé Goëau, Erick Mata-Montero
    TDWG: Biodiversity Information Standards, Oct 2017, Ottawa, Canada. pp.e20302.
  12. LifeCLEF 2017 Lab Overview: Multimedia Species Identification Challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Jean-Christophe Lombardo, Robert Planque, Simone Palazzo, Henning Müller
    CLEF: Cross-Language Evaluation Forum, Sep 2017, Dublin, Ireland. pp.255-274.
  13. Plant identification based on noisy web data: the amazing performance of deep learning (LifeCLEF 2017)
    Hervé Goëau, Pierre Bonnet, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2017, Dublin, Ireland. <http://ceur-ws.org/Vol-1866/invited_paper_9.pdf>
  14. LifeCLEF Bird Identification Task 2017
    Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2017, Dublin, Ireland. <http://ceur-ws.org/Vol-1866/invited_paper_8.pdf>
  15. TARDIS: Optimal Execution of Scientific Workflows in Apache Spark
    Daniel Gaspar, Fabio Porto, Reza Akbarinia, Esther Pacitti
    DaWaK: Data Warehousing and Knowledge Discovery, Aug 2017, Lyon, France. pp.74-87.
  16. Pre-processing and Indexing techniques for Constellation Queries in Big Data
    Amir Khatibi, Fabio Porto, Joao Rittmeyer, Eduardo Ogasawara, Patrick Valduriez, Dennis Shasha
    DaWaK: Data Warehousing and Knowledge Discovery, Aug 2017, Lyon, France. pp.164-172.
  17. Going deeper in the automated identification of Herbarium specimens
    Pierre Bonnet, Alexis Joly, Hervé Goëau, Jean-Christophe Lombardo, Antoine Affouard, Sen Wang, Rémi Knaff, Jean-François Molino, Daniel Barthélémy
    Botany - Botanical Crossroads, Jun 2017, Forth Worth, TX, United States. <http://2017.botanyconference.org/engine/search/index.php?func=detail&aid=83>
  18. Massively Distributed Environments and Closed Itemset Mining: The DCIM Approach
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    CAiSE: Advanced Information Systems Engineering, Jun 2017, Essen, Germany. pp.231-246.
  19. Pl@ntNet app in the era of deep learning
    Antoine Affouard, Hervé Goëau, Pierre Bonnet, Jean-Christophe Lombardo, Alexis Joly
    nnet, Jean-Christophe Lombardo, Alexis Joly. Pl@ntNet app in the era of deep learning. ICLR: International Conference on Learning Representations, Apr 2017, Toulon, France. <https://www.iclr.cc/archive/www/2017.html>

2016

  1. Benchmarking Polystores: the CloudMdsQL Experience
    Boyan Kolev, Raquel Pau, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jiménez-Peris, José Pereira
    Workshop on Methods to Manage Heterogeneous Big Data and Polystore Databases, Dec 2016, Washington, DC, United States. pp.2574-2579.
  2. Managing Hot Metadata for Scientific Workflows on Multisite Clouds
    Luis Pineda-Morales, Ji Liu, Alexandru Costan, Esther Pacitti, Gabriel Antoniu, Patrick Valduriez, Marta Mattoso
    Big Data, Dec 2016, Washington, DC, United States. pp.390-397.
  3. Demonstration of the CloudMdsQL Multistore System
    Boyan Kolev, Carlyna Bondiombouy, Patrick Valduriez, Ricardo Jiménez-Peris, Raquel Spain, José Pereira
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2016, Poitiers, France. <https://bda2016.ensma.fr/>
  4. Privacy Preserving Query Processing in the Cloud
    Sakina Mahboubi, Reza Akbarinia, Patrick Valduriez
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2016, Poitiers, France. <https://bda2016.ensma.fr>
  5. Mining Maximally Informative k-Itemsets in Massively Distributed Environments
    Saber Salah, Reza Akbarinia, Florent Masseglia
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2016, Poitiers, France. <https://bda2016.ensma.fr>
  6. Extending CloudMdsQL with MFR for Big Data Integration
    Carlyna Bondiombouy, Boyan Kolev, Patrick Valduriez, Oleksandra Levchenko
    BDA: Gestion de Données — Principes, Technologies et Applications, LIAS / ISAE-ENSMA, Poitiers, Nov 2016, Poitiers, France. <https://bda2016.ensma.fr>
  7. Scientific Workflow Execution with Multiple Objectives in Multisite Clouds
    Ji Liu, Esther Pacitti, Patrick Valduriez, Daniel de Oliveira, Marta Mattoso
    BDA: Gestion de Données — Principes, Technologies et Applications, LIAS / ISAE-ENSMA, Poitiers, Nov 2016, Poitiers, France. <https://bda2016.ensma.fr>
  8. Online Input Data Reduction in Scientific Workflows
    Renan Souza, Vítor Silva, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    WORKS: Workflows in Support of Large-scale Science, Nov 2016, Salt Lake City, United States. <http://works.cs.cardiff.ac.uk>
  9. Crowdsourcing Biodiversity Monitoring: How Sharing your Photo Stream can Sustain our Planet
    Alexis Joly, Hervé Goëau, Julien Champ, Samuel Dufour-Kowalski, Henning Müller, Pierre Bonnet
    MM: Conference on Multimedia, Oct 2016, Amsterdam, Netherlands. pp.958-967.
  10. ThePlantGame: Actively Training Human Annotators for Domain-specific Crowdsourcing
    Maximilien Servajean, Alexis Joly, Dennis Shasha, Julien Champ, Esther Pacitti
    MM: Conference on Multimedia, Oct 2016, Amsterdam, Netherlands. pp.720-721.
  11. Plant Identification in an Open-world (LifeCLEF 2016)
    Hervé Goëau, Pierre Bonnet, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2016, Évora, Portugal. pp.428-439.
  12. LifeCLEF Bird Identification Task 2016: The arrival of Deep learning
    Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2016, Évora, Portugal. pp.440-449.
  13. Floristic participation at LifeCLEF 2016 Plant Identification Task
    Julien Champ, Hervé Goëau, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2016, Évora, Portugal. pp.450-458.
  14. LifeCLEF 2016: Multimedia Life Species Identification Challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Julien Champ, Robert Planqué, Simone Palazzo, Henning Müller
    CLEF: Cross-Language Evaluation Forum, Sep 2016, Évora, Portugal. pp.286-310.
  15. Unsupervised Individual Whales Identification: Spot the Difference in the Ocean
    Alexis Joly, Jean-Christophe Lombardo, Julien Champ, Anjara Saloma
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2016, Évora, Portugal. pp.469-480.
  16. Enhancing Energy Production with Exascale HPC Methods
    José Camata, José Cela, Danilo Costa, Alvaro Lga Coutinho, Daniel Fernández-Galisteo, Carmen Jimenez, Vadim Kourdioumov, Marta Mattoso, Rafael Mayo-García, Thomas Miras, José Moríñigo, Jorge Navarro, Philippe Navaux, Daniel de Oliveira, Manuel Rodríguez-Pascual, Vítor Silva, Renan Souza, Patrick Valduriez
    CARLA: Latin American High Performance Computing Conference, Aug 2016, Mexico City, Mexico. pp.233-246.
  17. Scientific Workflow Scheduling with Provenance Support in Multisite Cloud
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    VECPAR: Vector and Parallel Processing, Faculty of Engineering of the University of Porto, Portugal, Jun 2016, Porto, Portugal. pp.206-219.
  18. The CloudMdsQL Multistore System
    Boyan Kolev, Carlyna Bondiombouy, Patrick Valduriez, Ricardo Jiménez-Peris, Raquel Pau, José Pereira
    ACM SIGMOD, Jun 2016, San Francisco, United States. <10.1145/2882903.2899400>
  19. Development of a knowledge system for Big Data: Case study to plant phenotyping data
    Luyen Le Ngoc, Anne Tireau, Aravind Venkatesan, Pascal Neveu, Pierre Larmande
    WIMS: Web Intelligence, Mining and Semantics, Mines Ales, Jun 2016, Nimes, France. <10.1145/2912845.2912869>
  20. Exposing French agronomic resources as Linked Open Data
    Aravind Venkatesan, Nordine El Hassouni, Florian Phillipe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
    IN-OLIVE, Jun 2016, Montpellier, France.
  21. Spatially Localized Visual Dictionary Learning
    Valentin Leveau, Alexis Joly, Olivier Buisson, Patrick Valduriez
    ICMR: International Conference on Multimedia Retrieval, Jun 2016, New York, United States. pp.367-370.
  22. A New Privacy-Preserving Solution for Clustering Massively Distributed Personal Times-Series
    Tristan Allard, Georges Hébrail, Florent Masseglia, Esther Pacitti
    ICDE: International Conference on Data Engineering, May 2016, Helsinki, Finland. pp.1370-1373.
  23. Design and Implementation of the CloudMdsQL Multistore System
    Boyan Kolev, Carlyna Bondiombouy, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jimenez-Péris, Raquel Pau, Jose Pereira
    CLOSER: Cloud Computing and Services Science, Apr 2016, Roma, Italy. pp.352-359.

2015

  1. Exposing French agronomic resources as Linked Open Data
    Aravind Venkatesan, Nordine El Hassouni, Florian Philippe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
    SWAT4LS: Semantic Web Applications and Tools for Life Sciences, Dec 2015, Cambridge, United Kingdom. <http://ceur-ws.org/Vol-1546/>
  2. Managing Simulation Data with Multidimensional Arrays
    Hermano Lustosa, Fabio Porto, Ramon Costa, Pablo Blanco, Patrick Valduriez
    SBBD: Simpósio Brasileiro de Banco de Dados, Centro Federal de Educa‹o Tecnol—gica Celso Suckow da Fonseca (CEFET-RJ), Brazil; Laborat—rio Nacional de Computa‹o Cient’fica (LNCC), Oct 2015, Petropolis, Brazil. <http://dexl.lncc.br/sbbd2015/>
  3. Query Processing in Cloud Multistore Systems
    Carlyna Bondiombouy
    BDA: Gestion de Données — Principes, Technologies et Applications, Sep 2015, Île de Porquerolles, France. <http://bda2015.univ-tln.fr>
  4. Ontology-based services and knowledge management in the Agronomic Domain
    Pierre Larmande
    RDA: Research Data Alliance, Sep 2015, Paris, France. <https://rd-alliance.org/plenary-meetings/rda-sixth-plenary-meeting.html>
  5. Shared nearest neighbors match kernel for bird songs identification -LifeCLEF 2015 challenge
    Alexis Joly, Valentin Leveau, Julien Champ, Olivier Buisson
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2015, Toulouse, France. <http://ceur-ws.org/Vol-1391/138-CR.pdf>
  6. A comparative study of fine-grained classification methods in the context of the LifeCLEF plant identification challenge 2015
    Julien Champ, Titouan Lorieul, Maximilien Servajean, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2015, Toulouse, France. <http://ceur-ws.org/Vol-1391/30-CR.pdf>
  7. LifeCLEF 2015: Multimedia Life Species Identification Challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Simone Palazzo, Bob Fisher, Henning Müller
    CLEF: Cross-Language Evaluation Forum, Sep 2015, Toulouse, France. pp.462-483.
  8. LifeCLEF Plant Identification Task 2015
    Hervé Goëau, Pierre Bonnet, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2015, Toulouse, France. <http://ceur-ws.org/Vol-1391/157-CR.pdf>
  9. LifeCLEF Bird Identification Task 2015
    Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2015, Toulouse, France. <http://ceur-ws.org/Vol-1391/156-CR.pdf>
  10. Integrating Big Data and Relational Data with a Functional SQL-like Query Language
    Carlyna Bondiombouy, Boyan Kolev, Oleksandra Levchenko, Patrick Valduriez
    Globe, Sep 2015, Valencia, Spain. pp.170-185.
  11. Data Partitioning for Fast Mining of Frequent Itemsets in Massively Distributed Environments
    Saber Salah, Reza Akbarinia, Florent Masseglia
    DEXA: Database and Expert Systems Applications, Sep 2015, Valencia, Spain. pp.303-318.
  12. An Efficient Solution for Processing Skewed MapReduce Jobs
    Reza Akbarinia, Miguel Liroz-Gistau, Divyakant Agrawal, Patrick Valduriez
    Globe, Sep 2015, Valencia, Spain. pp.417-429.
  13. A Prime Number Based Approach for Closed Frequent Itemset Mining in Big Data
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    DEXA: Database and Expert Systems Applications, Sep 2015, Valencia, Spain. pp.509-516.
  14. Fast Parallel Mining of Maximally Informative k-Itemsets in Big Data
    Saber Salah, Reza Akbarinia, Florent Masseglia
    ICDM: International Conference on Data Mining, Aug 2015, Atlantic city, United States. pp.359-368.
  15. When sharing computer science with everyone also helps avoiding digital prejudices
    Marie Duflot, Martin Quinson, Florent Masseglia, Didier Roy, Julien Vaubourg, Thierry Viéville
    Escape computer dirty magic: learn Scratch !. SCRATCH, Aug 2015, Amsterdam, Netherlands. <http://www.scratch2015ams.org/>
  16. On Term Selection Techniques for Patent Prior Art Search
    Mona Golestan Far, Scott Sanner, Mohamed Reda Bouadjenek, Gabriela Ferraro, David Hawking
    SIGIR: Research and Development in Information Retrieval, Aug 2015, Santiago, Chile. <10.1145/2766462.2767801>
  17. Aggregation-Aware Compression of Probabilistic Streaming Time Series
    Reza Akbarinia, Florent Masseglia
    MLDM: Machine Learning and Data Mining, Jul 2015, Hamburg, Germany. pp.232-247.
  18. Optimizing the Data-Process Relationship for Fast Mining of Frequent Itemsets in MapReduce
    Saber Salah, Reza Akbarinia, Florent Masseglia
    MLDM: Machine Learning and Data Mining, Jul 2015, Hamburg, Germany. pp.217-231.
  19. Towards efficient data integration and knowledge management in the Agronomic domain
    Aravind Venkatesan, Nordine El Hassouni, Florian Phillipe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
    APIA: Applications Pratiques de l'Intelligence Artificielle, Jul 2015, Rennes, France. pp.117-124.
  20. OpenAlea: Scientific Workflows Combining Data Analysis and Simulation
    Christophe Pradal, Christian Fournier, Patrick Valduriez, Sarah Cohen-Boulakia
    SSDBM: Scientific and Statistical Database Management, Jun 2015, San Diego, United States. <10.1145/2791347.2791365>
  21. Kernelizing Spatially Consistent Visual Matches for Fine-Grained Classification
    Valentin Leveau, Alexis Joly, Olivier Buisson, Patrick Valduriez
    ICMR: International Conference on Multimedia Retrieval, Jun 2015, Shangai, China. pp.155-162.
  22. DigInPix: Visual Named-Entities Identification in Images and Videos
    Pierre Letessier, Nicolas Hervé, Alexis Joly, Hakim Nabi, Mathieu Derval, Olivier Buisson
    ICMR: International Conference on Multimedia Retrieval, Jun 2015, Shanghai, China. pp.661-664.
  23. A Study of Query Reformulation for Patent Prior Art Search with Partial Patent Applications
    Mohamed Reda Bouadjenek, Scott Sanner, Gabriela Ferraro
    ICAIL: International Conference on Artificial Intelligence and Law, Jun 2015, San Diego, United States. pp.23-32.
  24. Chiaroscuro: Transparency and Privacy for Massive Personal Time-Series Clustering
    Tristan Allard, Georges Hébrail, Florent Masseglia, Esther Pacitti
    SIGMOD: International Conference on Management of Data, May 2015, Melbourne, Australia. pp.779-794.
  25. Data-intensive HPC: opportunities and challenges
    Patrick Valduriez
    BDEC: Big Data and Extreme-scale Computing, Barcelona Supercomputing Center, Jan 2015, Barcelone, Spain.

2014

  1. Recognizing Thousands of Legal Entities through Instance-based Visual Classification
    Valentin Leveau, Alexis Joly, Olivier Buisson, Pierre Letessier, Patrick Valduriez
    MM: Conference on Multimedia, Nov 2014, Orlando, FL, United States. pp.1029-1032.
  2. Fine-grained Visual Faceted Search
    Julien Champ, Alexis Joly, Pierre Bonnet
    MM: Conference on Multimedia, Nov 2014, Orlando, FL, United States. pp.721-722.
  3. Layer Decomposition: An Effective Structure-based Approach for Scientific Workflow Similarity
    Johannes Starlinger, Sarah Cohen-Boulakia, Sanjeev Khanna, Susan Davidson, Ulf Leser
    International Conference on e-Science, Oct 2014, Guarujá, Brazil. pp.169-176.
  4. NACluster: A Non-Supervised Clustering Algorithm for Matching Multi Catalogues
    Vinicius P. Freire, José A. F. de Macêdo, Fábio Porto, Reza Akbarinia
    International Conference on e-Science, Oct 2014, Guarujá, SP, Brazil. pp.83-86.
  5. PlantRT : a Distributed Recommendation Tool for Citizen Science
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Alexis Joly, Julien Champ
    BDA: Gestion de Données — Principes, Technologies et Applications, Oct 2014, Autrans, France. pp.48-50.
  6. Compression de flux de données probabilistes attentive à l'agrégation
    Reza Akbarinia, Florent Masseglia
    BDA: Gestion de Données — Principes, Technologies et Applications, Oct 2014, Autrans, France. <http://bda2014.imag.fr>
  7. Exploiting Diversification in Distributed Recommendation
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
    BDA: Gestion de Données — Principes, Technologies et Applications, IMAG, Oct 2014, Autrans, France. <http://bda2014.imag.fr>
  8. Multisite Management of Data-intensive Scientific Workflows in the Cloud
    Ji Liu
    BDA: Gestion de Données — Principes, Technologies et Applications, Oct 2014, Autrans, France. pp.28-30.
  9. Pl@ntNet, une plate-forme innovante d'agrégation et partage d'observations botaniques
    Daniel Barthélémy, Nozha Boujemaa, Jean-François Molino, Alexis Joly, Hervé Goëau, Vera Bakić, Souheil Selmi, Julien Champ, Jennifer Carre, Mathias Chouet, Aurélien Perronnet, Christelle Vignau, Samuel Dufour-Kowalski, Antoine Affouard, Julien Barbe, Pierre Bonnet
    International Conference ‘Botanists of the Twenty-first Century’, UNESCO, Sep 2014, Paris, France. pp.191-197.
  10. Instance-based Bird Species Identification with Undiscriminant Features Pruning
    Alexis Joly, Julien Champ, Olivier Buisson
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2014, Sheffield, United Kingdom. pp.625-633.
  11. PlantNet Participation at LifeCLEF2014 Plant Identification Task
    Hervé Goëau, Alexis Joly, Itheri Yahiaoui, Vera Bakić, Anne Verroust-Blondet, Pierre Bonnet, Daniel Barthélémy, Nozha Boujemaa, Jean-François Molino
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2014, Sheffield, United Kingdom. pp.724-737.
  12. LifeCLEF Bird Identification Task 2014
    Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2014, Sheffield, United Kingdom. pp.585-597.
  13. LifeCLEF 2014: Multimedia Life Species Identification Challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Robert Planque, Andreas Rauber, Bob Fisher, Henning Müller
    CLEF: Cross-Language Evaluation Forum, Sep 2014, Sheffield, United Kingdom. pp.229-249.
  14. LifeCLEF Plant Identification Task 2014
    Hervé Goëau, Alexis Joly, Pierre Bonnet, Souheil Selmi, Jean-François Molino, Daniel Barthélémy, Nozha Boujemaa
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2014, Sheffield, United Kingdom. pp.598-615.
  15. Exploiting Diversification in Gossip-Based Recommendation
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
    Globe, Sep 2014, Munich, Germany. pp.25-36.
  16. Scientific Workflow Partitioning in Multi-site Clouds
    Ji Liu, Esther Pacitti, Patrick Valduriez, Vitor Silva Souza, Marta Mattoso
    Euro-Par: Parallel Processing Workshops, Aug 2014, Porto, Portugal. pp.105-116.
  17. LifeCLEF: Multimedia Life Species Identification
    Alexis Joly, Henning Müller, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Andreas Rauber, Pierre Bonnet, Willem-Pier Vellinga, Bob Fisher
    EMR: Environmental Multimedia Retrieval, Apr 2014, Glasgow, United Kingdom. pp.7-13.
  18. Pl@ntNet Mobile 2014: Android port and new features
    Hervé Goëau, Bonnet Pierre, Alexis Joly, Antoine Affouard, Vera Bakić, Julien Barbe, Samuel Dufour-Kowalski, Souheil Selmi, Yahiaoui Itheri, Christel Vignau, Daniel Barthelemy, Nozha Boujemaa
    ICMR: International Conference on Multimedia Retrieval, Apr 2014, Glasgow, United Kingdom. pp.527-530.

Tags

Scientific data, Uncertain data, Data processing, Data analysis, Social-based data sharing, Recommendation, Scientific workflows, Data integration, Content-based information retrieval, P2P, Cloud

Last update on 01/07/2019