CERN Accelerating science

IT Posters

Letzte Einträge:
2021-01-29
12:31
Exhibition - visit circuit: the history of the CERN Data Centre
Exposition - Circuit de visites : l'histoire du Centre de calcul du CERN

Reference: Poster-2021-1043
Keywords:  computing  history  Data Centre  Centre de calcul  Computing Centre
Created: 2020. -10 p
Creator(s): Gaillard, Melissa; Perrey, Melissa Loyse

These ten panels have been installed in building 49 (corridor linking building 31 to 513) as part of the visit circuit. They give an overview of the history of the CERN Data Centre between CERN's creation and 2020. Languages : French and EnglishCes 10 panneaux ont été installés dans le bâtiment 49 (couloir reliant le bâtiment 31 au bâtiment 513) dans le cadre du circuit de visites. Ils retracent brièvement l'histoire du Centre de calcul du CERN de la création du laboratoire jusqu'à 2020. Langues : anglais et français.

Related links:
Exhibition area - building 49 - 31 - 513
© CERN Geneva

Access to files

Details des Eintrags - Ähnliche Datensätze
2019-10-22
15:01
Support for HTCondor High-Throughput Computing Workflows in the REANA Reusable Analysis Platform
Reference: Poster-2019-942
Keywords:  reproducible science  computational workflows  high-throughput computing  high-performance computing  data analysis
Created: 2019. -1 p
Creator(s): Maciulaitis, Rokas; Bremer, Paul; Hampton, Scott; Hildreth, Mike; Hurtado Anampa, Kenyi Paolo [...]

REANA is a reusable and reproducible data analysis platform allowing researchers to structure their analysis pipelines and run them on remote containerised compute clouds. REANA supports several different workflows systems (CWL, Serial, Yadage) and uses Kubernetes’ job execution backend. We have designed an abstract job execution component that extends the REANA platform job execution capabilities to support multiple compute backends. We have tested the abstract job execution component with HTCondor and verified the scalability of the designed solution. The results show that the REANA platform would be able to support hybrid scientific workflows where different parts of the analysis pipelines can be executed on multiple computing backends.

Related links:
eScience 2019 Poster Session
© CERN Geneva

Access to files

Details des Eintrags - Ähnliche Datensätze
2018-12-22
09:57
Trident : An Automated System Tool for Collecting and Analyzing Performance Counters
Reference: Poster-2018-667
Created: 2018. -1 p
Creator(s): Muralidharan, Servesh; Smith, David

Trident, a qualitative analysis tool that can look at various low level metrics with respect to the Core, Memory and I/O to highlight performance bottlenecks during the execution of an application. Trident uses a three pronged approach in analysing a node's utilisation of hardware resources and to help a non system expert understand the stress on different parts of the system by a given job. Currently metrics such as memory bandwidth, core utilization, active processor cycles, etc., are being collected. Interpretation of the data in raw form is often non intuitive. Therefore, the tool converts these data into derived metrics that are then represented as a system wide extended Top-Down analysis that helps developers and site managers likewise understand the application behavior without the need for in-depth expertise of architecture details.

© CERN Geneva

Access to file

Details des Eintrags - Ähnliche Datensätze
2018-12-19
15:06
Search for computational workflow synergies in reproducible research data analyses in particle physics and life sciences
Reference: Poster-2018-666
Keywords:  reproducible science  data preservation  data analysis  computational workflows
Created: 2018. -1 p
Creator(s): Šimko, Tibor; Cranmer, Kyle; Crusoe, Michael R; Heinrich, Lukas; Khodak, Anton [...]

We describe the REANA reusable and reproducible research data analysis platform that originated in the domain of particle physics. We integrated support for running Common Workflow Language (CWL) workflows that originated in the domain of life sciences. This integration allowed us to study the applicability of CWL to particle physics analyses and look for synergies in computational practices in the two communities.

© CERN Geneva

Access to files

Details des Eintrags - Ähnliche Datensätze
2018-11-15
09:46
Increasing Windows security by hardening PC configurations
Reference: Poster-2018-664
Keywords:  CHEP  Security  Hardened PC  Windows  IT-CDA-AD
Created: 2018. -1 p
Creator(s): Martin Zamora, Pablo; Kwiatek, Michal; Bippus, Vincent Nicolas; Cruz Elejalde, Eneko

Over 8000 Windows PCs are actively used on the CERN site for tasks ranging from controlling the accelerator facilities to processing invoices. PCs are managed through CERN's Computer Management Framework and Group Policies, with configurations deployed based on machine sets and a lot of autonomy left to the end-users. While the generic central configuration works well for the majority of the users, a specific hardened PC configuration is now provided for users who require stronger resilience against external attacks.

Related links:
CHEP 2018
© CERN Geneva

Access to files

Details des Eintrags - Ähnliche Datensätze
2018-08-22
18:01
CDS Videos - The new platform for CERN videos
Reference: Poster-2018-654
Created: 2018. -1 p
Creator(s): Marian, Ludmila; Gabancho, Esteban; Gonzalez Lopez, Jose Benito; Tarocco, Nicola; Costa, Flavio [...]

CERN Document Server (CDS, cds.cern.ch) is the CERN Institutional Repository based on the Invenio open source digital repository framework. It is a heterogeneous repository, containing more than 2 million records, including research publications, audiovisual material, images, and the CERN archives. Its mission is to store and preserve all the content produced at CERN as well as to make it easily available to any outlet interested. CDS aims to be the CERN’s document hub. To achieve this we are transforming CDS into an aggregator over specialized repositories, each having its own software stack, with features enabled based on the repository’s content. The aim is to enable each content producer community to have its own identity, both visually and functionally, as well as increased control on the data model and the submission, curation, management, and dissemination of the data. This separation is made possible by using the Invenio 3 framework. The first specialized repository created is CDS Videos (videos.cern.ch). It has been launched in December 2017, and is the first step in the long-term project to migrate the entire CDS to the Invenio 3 framework. CDS Videos provides an integrated submission, long-term archival and dissemination of CERN video material. It offers a complete solution for the CERN video team, as well as for any department or user at CERN, to upload video productions. The CDS Videos system will ingest the video material, interact with the transcoding server for generating web and broadcaster subformats, mint DOI persistent identifiers, generate embeddable code to be reused by any other website, and store the master files for long-term archival. The talk will detail the software architecture of the CDS Videos as well as the infrastructure needed to run such a large-scale web application. It will present the technical solutions adopted, including the Python-based software stack (using among others Flask, IIIF, ElasticSearch, Celery, RabbitMQ) and the new AngularJS-based user interface which was exclusively designed for CDS Videos. It will also present our solution to a lossless migration of data: more than 5'000 videos from 1954 to 2017, summing up to 30TB of files, have been migrated from DFS to EOS in order to populate the CDS Videos platform. All this could be of high interest to other institutes wanting to reuse the CDS Videos open source code for creating their own video platform. Last but not least, the talk will detail how the user community at CERN and beyond can take advantage of the CDS Videos platform for creating and disseminating video content.

© CERN Geneva

Access to files

Details des Eintrags - Ähnliche Datensätze
2018-08-09
14:07
Honeypot Resurrection - Redesign of CERN's Security Honeypots
Reference: Poster-2018-653
Keywords:  computer security, honeypot, SOC
Created: 2018. -1 p
Creator(s): Buschendorf, Fabiola

Honeypots are a fake system residing in a companie's or organization's network, attracting attackers by emulating old and vulnerable software. If a Honeypot is accessed, all actions are logged and any submitted files are being stored on the host machine. The current Honeypot at CERN is deprecated and does not provide useful notifications. The task of this summer student project is to identify well maintained and up-to-date open source honeypots, test and configure them and finally deploy them to convincingly resemble a CERN host in order to collect information about potentially malicious activity inside the GPN.

© CERN Geneva

Access to files

Details des Eintrags - Ähnliche Datensätze
2017-07-24
10:10
Publication Life Cycle at CERN Document Server
Reference: Poster-2017-593
Keywords:  Open Repositories  Invenio  CDS
Created: 2017. -1 p
Creator(s): Witowski, Sebastian; Gonzalez Lopez, Jose Benito; Costa, Flavio; Gabancho, Esteban; Marian, Ludmila [...]

This presentation guides listeners through all the stages of publication life cycle at CERN Document Server, from the ingestion using one of the various tools, through curation and processing, until the data is ready to be exported to other systems. It describes different tools that we are using to curate the incoming publications as well as to further improve the existing data on CDS. The second part of the talk goes through various challenges we have faced in the past and how we are going to overcome them in the new version of CDS.

Related links:
Open Repositories
© CERN Geneva

Access to file

Details des Eintrags - Ähnliche Datensätze
2017-07-17
17:30
Python at CERN
Reference: Poster-2017-592
Keywords:  Python  CERN  PyROOT  SWAN  Invenio  Indico
Created: 2017. -1 p
Creator(s): Witowski, Sebastian

The Large Hadron Collider at CERN is producing 600 million collisions every second. Only 1 in a million collisions is interesting. It requires a fast programming language to analyze and filter this amount of data. Is Python such a language? No, it’s not. Does it mean there is no place for Python in one of the largest scientific facilities in the world? Quite the contrary. The ease of use and a very low learning curve makes Python a perfect programming language for many physicists and other people without the computer science background. CERN does not only produce large amounts of data. The interesting bits of data have to be stored, analyzed, shared and published. Work of many scientists across various research facilities around the world has to be synchronized. This is the area where Python flourishes. And with CERN’s pursuit to create and use open source software, many interesting projects were born. To facilitate the analysis of data, ROOT framework [https://root.cern.ch/] was created. It’s a C++ framework focused on big data processing, statistical analysis, visualization and storage. It has been around for more than 20 years, but since nowadays more and more scientists have at least basic Python knowledge, the PyROOT project [https://root.cern.ch/pyroot] was born. PyROOT is a Python extension module that allows users to interact with ROOT from Python interpreter. It combines the ease of use of Python with the powerful capabilities of the ROOT framework. All the discoveries, small and big ones, results in thousands of publications that has to go through the whole publication workflow. For that purpose, a digital library framework called Invenio was created [http://invenio-software.org/]. It can be used to easily build your own fully customized digital library, institutional repository, multimedia archive, or research data repository on the web. Some examples of websites build with Invenio are: https://zenodo.org/, https://cds.cern.ch/ or https://analysispreservation.cern.ch/. Another of CERN’s missions is to share the knowledge, and that can be done through various lectures, workshops and conferences. All those events can easily be organized with the help of Indico [http://indico-software.org/]. Indico comes also with a room booking module and can be easily integrated with various collaborative tools.

Related links:
EuroPython 2017
© CERN Geneva

Access to files

Details des Eintrags - Ähnliche Datensätze
2017-07-05
17:28
Using Invenio for managing and running open data repositories
Reference: Poster-2017-590
Created: 2017. -1 p
Creator(s): Simko, Tibor; Kuncar, Jiri; Nielsen, Lars Holm

We present how a research data repository manager can build custom open data solutions to ingest, describe, preserve, and disseminate the open research environments, datasets and software using the Invenio digital library framework. We discuss a concrete use case example of the CERN Open Data and Zenodo services, describing technological challenges in preparing large sets of data for general public. We address the questions of efficient linking and sharing of large quantities of data without unnecessary duplication on the backend, the role of the file transfer protocols, as well as the means to visualise data to make it more accessible and interactive for general public. The technological challenges and discussed solutions can be applied to any research discipline outside the domain of particle physics.

© CERN Geneva

Access to files

Details des Eintrags - Ähnliche Datensätze