---------------------------------------------------------------------------------------------------------------
EGI BROADCAST TOOL : https://operations-portal.egi.eu/broadcast/send
---------------------------------------------------------------------------------------------------------------
Publication from : Antonio Santiago Cofino Gonzalez <antonio.cofino(a)unican.es>
----------------------------------------------------------------------------------------------------------------
Dear All,
I would like to announce that UNICAN site has started the decommission procedure.
Timeline
====
4th of April 2017
- Broadcast of timeline to VO managers and users.
- The all of its grid services will begin scheduled downtime until
19th of April , during which time VO managers and users can retrieve
data.
19th of April
- Start of scheduled Downtime until 4th of June
4th of June 2017
- *** HARD DEADLINE for VO Managers to retrieve all data from storage
elements. ***
- The resource centre status will be marked "suspended".
- Resource centre hardware and services may become inaccessible
without further notice to VO managers, users, etc. from this date.
5th of August 2017
- End of log retention period.
- Resource centre will be marked "closed".
Best regards
Antonio S. Cofiño
----------------------------------------------------------------------------------------------------------------
link to this broadcast : https://operations-portal.egi.eu/broadcast/archive/1682
----------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------
EGI BROADCAST TOOL : https://operations-portal.egi.eu/broadcast/send
---------------------------------------------------------------------------------------------------------------
Publication from : Cyril Lorphelin <cic-information(a)cc.in2p3.fr>
----------------------------------------------------------------------------------------------------------------
Dear Users,
The VAPOR application v2.2 is now online : https://operations-portal.egi.eu/vapor
Here is the list of the main changes
1) Map : http://operations-portal.egi.eu/vapor/resources/GL2Map
- you can filter the sites with a vo - only sites supporting this vo will be visible
- you can apply a text filter under the map . This filter will be applied to the table but also to the map .
2) Faulty resources : http://operations-portal.egi.eu/vapor/resources/GL2ResFaulty
- the reason of the presence of the resource is now visible
- faulty publications are highlighted when possible
- faulty values are now visible for jobs and benchmark
3) Figures
- some details have been added
- CPU and Storage have been split for a better understanding
4) A complete API is available : http://operations-portal.egi.eu/vapor/downloadLavoisierInfo
The computation of values of CPU and storages have been deeply reviewed .
Nevertheless some values are still not in line with the reality .
Next version will be focused on these computations to be able to provide better figures.
For more details : http://operations-portal.egi.eu/vapor/releases?name=VAPOR+2.2
Don't hesitate to contact us for comments, feedback, bugs at cic-information(a)in2p3.fr
Cheers,
----------------------------------------------------------------------------------------------------------------
link to this broadcast : https://operations-portal.egi.eu/broadcast/archive/1659
----------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------
EGI BROADCAST TOOL : https://operations-portal.egi.eu/broadcast/send
---------------------------------------------------------------------------------------------------------------
Publication from : adrian coveney <apel-admins(a)stfc.ac.uk>
----------------------------------------------------------------------------------------------------------------
The APEL Accounting Repository is being updated today, so there may be a delay in accounting data reaching the Portal. This will mostly affect cloud accounting which requires extra processing during the update.
The APEL Team.
----------------------------------------------------------------------------------------------------------------
link to this broadcast : https://operations-portal.egi.eu/broadcast/archive/1638
----------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------
EGI BROADCAST TOOL : https://operations-portal.egi.eu/broadcast/send
---------------------------------------------------------------------------------------------------------------
Publication from : Alessandro Paolini <alessandro.paolini(a)egi.eu>
----------------------------------------------------------------------------------------------------------------
=============== Contents: ==================
1) Decommissioning dCache 2.10
2) CMD-OS 1.0.0 for OpenStack released
3) UMD 3.14.7 and UMD 4.3.2
4) Recommendation for reporting security incidents
5) GSTAT decommissioned
==========================================
1) Decommissioning dCache 2.10
Support for the dCache 2.10 ended at December 2016. As a consequence, according to EGI policies, dCache 2.10 must be decommissioned. All sites are invited to plan an upgrade their 2.10 endpoints to a newer golden release, that can be 2.13 (whose support ends on July 2017) or 2.16 (whose support ends on May 2018).
A decommissioning campaign will be started in the next days by EGI Operations to monitor the upgrade of the dCache 2.10 instances and follow up with the NGIs/sites.
Please consider that 2.13 gets out of support in only 6 months, and that the dCache team does not support the upgrade from 2.10 directly to 2.16.
More information here: https://www.dcache.org/downloads/1.9/index.shtml
2) CMD-OS 1.0.0 for OpenStack released
The very first version of CMD-OS, the Cloud Middleware Distribution for OpenStack Mitaka, have been released. It includes Keystone-VOMS 9.0.3, ooi 0.3.2, gridsite 2.3.3, Cloud BDII Information provider 0.6.12. For more details, please visit: http://repository.egi.eu/category/os-distribution/cmd-os-1/
3) UMD 3.14.7 and UMD 4.3.2 have been released on December 5th (fix releases), with an update for umd-release, fixing an issue with GPG keys.
4) Recommendation for reporting security incidents
In according to the EGI CSIRT Security Incident Handling Procedure (https://wiki.egi.eu/wiki/SEC01), please report any security incident to your local security team, your NGI Security Officer and the EGI CSIRT via abuse(a)egi.eu
5) GSTAT decommissioned
As announced in the past months, GSTAT has been decommissioned and replaced by VAPOR: https://operations-portal.egi.eu/vapor/
We would like to thank a lot ROC_AsiaPacific for having provided and managed such a powerful tool over severla years and projects.
----------------------------------------------------------------------------------------------------------------
link to this broadcast : https://operations-portal.egi.eu/broadcast/archive/1631
----------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------
EGI BROADCAST TOOL : https://operations-portal.egi.eu/broadcast/send
---------------------------------------------------------------------------------------------------------------
Publication from : stephen jones <sjones(a)hep.ph.liv.ac.uk>
----------------------------------------------------------------------------------------------------------------
Ticket (see below) gives details. CE will be in downtime from 00:00 14 Feb 2017. WIll be removed after two weeks.
https://ggus.eu/?mode=ticket_info&ticket_id=126167
----------------------------------------------------------------------------------------------------------------
link to this broadcast : https://operations-portal.egi.eu/broadcast/archive/1622
----------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------
EGI BROADCAST TOOL : https://operations-portal.egi.eu/broadcast/send
---------------------------------------------------------------------------------------------------------------
Publication from : francesco fabozzi <francesco.fabozzi(a)na.infn.it>
----------------------------------------------------------------------------------------------------------------
Dear VO Managers,
the site INFN-NAPOLI-CMS is under decommissioning.
The site will be put in downtime to prevent new activities.
----------------------------------------------------------------------------------------------------------------
link to this broadcast : https://operations-portal.egi.eu/broadcast/archive/1599
----------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------
EGI BROADCAST TOOL : https://operations-portal.egi.eu/broadcast/send
---------------------------------------------------------------------------------------------------------------
Publication from : Alessandro Paolini <alessandro.paolini(a)egi.eu>
----------------------------------------------------------------------------------------------------------------
======= Content ========
1) UMD releases
2) Decommission of mon.egi.eu and cloudmon.egi.eu
======================
1) On Nov 23rd two revisions of UMD 3.14.6 (SL6) and UMD 4.3.1 (SL6/CentOS7) have been released:
a) UMD 3.14.6 includes lcmaps-plugins-vo-ca-ap, needed for supporting the IGTF IOTA profile of CAs
b) UMD 4.3.1 includes:
*** CentOS7
lcas-lcmaps-gt4-interface 0.3.0-0.3.1
lcmaps 1.6.6
lcas 1.3.19
glExec 1.2.3
glExec-WN 1.3.0
lcmaps-plugins 1.7.1
*** SL6
ARGUS 1.7 (regular 4.3.0 shipped only CentOS7 version)
2) Decommission of mon.egi.eu and cloudmon.egi.eu
a) on 29 November all the cloud probes were moved to the central servers, and cloudmon.egi.eu was dismissed on Dec 1st.
All the probes are executed using the following certificate subjects:
/DC=EU/DC=EGI/C=HR/O=Robots/O=SRCE/CN=Robot:argo-egi@cro-ngi.hr
/DC=EU/DC=EGI/C=GR/O=Robots/O=Greek Research and Technology Network/CN=Robot:argo-egi@grnet.gr
b) On Dec 6th 2016 the old SAM GridMon box mon.egi.eu, housing central ATP and POEM, was decommissioned. These services became obsolete when we switched to central monitoring instances in July.
- The VO SAM instances will not be affected as they are using local ATP and POEM.
- Remaining NGI SAM instances rely on central ATP and will no longer get topology updates, so this gives their administrators extra incentive to decommission them.
----------------------------------------------------------------------------------------------------------------
link to this broadcast : https://operations-portal.egi.eu/broadcast/archive/1597
----------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------
EGI BROADCAST TOOL : https://operations-portal.in2p3.fr/broadcast/send
---------------------------------------------------------------------------------------------------------------
Publication from : Frederic Schaer <frederic.schaer(a)cea.fr>
----------------------------------------------------------------------------------------------------------------
Dear VOs and users,
It was found by the CMS experiment that a WN at the GRIF/IRFU site was silently corrupting files (thanks, CMS).
After investigations, it appears that a CPU on the machine was silently corrupting files while they were beeing compressed on the machine, only if the compression task was beeing run on core #8 of the CPU socket #0, in addition to it's sibling hyperthreaded core #28.
Unfortunately, this hardware issue remained unnoticed because uncaught by the various hardware and software system checks - neither Dell nor Intel diagnostic tools could find and report it.
Unfortunately also, root files seem to be affected. Or at least files created by the CMS software which includes root and recompiled copies of various compression tools.
It was found also that files compressed with the "bzip2" system tool was also corrupted, but not files created with the system lzma or gzip tools for instance.
Final bad news : we have no way to identify which files -your files- were produced on that machine.
We would therefore like to warn you about this problem, giving you as much details as possible.
The machine name is : wn328.datagrid.cea.fr
The ethernet MAC address of the main ntework interface is : 00:8C:FA:F2:93:1E
The host IPs are : 192.54.205.14 (v4) and 2001:660:3031:110:10::328/64 (v6)
The host entered production on Sep. 21 @ 9H49.
The host is running an up to date SL 6.8
Off course, the host was finally taken out of production (thanks again to cms ;) ) on November 25 2016@10H01 CET time, and the bad cpu should be changed this week.
We would like to apologize for this unwelcome hardware failure, as we already know finding the affected files will be a hard work that you would all have prefered to avoïd.
Best regards
The GRIF/IRFU admins
----------------------------------------------------------------------------------------------------------------
link to this broadcast : https://operations-portal.in2p3.fr/broadcast/archive/1591
----------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------
EGI BROADCAST TOOL : https://operations-portal.egi.eu/broadcast/send
---------------------------------------------------------------------------------------------------------------
Publication from : Alessandro Paolini <alessandro.paolini(a)egi.eu>
----------------------------------------------------------------------------------------------------------------
1) Av./Rel. recomputation for not considering the downtimes due to the vulnerability CVE-2016-5195
All the resource centres that were affected by the vulnerability CVE-2016-5195 and that declared a downtime between 2016-10-20 16:00 UTC and 2016-10-31 18:00 UTC are invited to request a recomputation of A/R figures for the days in which the downtime was ongoing.
In according to the procedure https://wiki.egi.eu/wiki/PROC10_Recomputation_of_SAM_results_or_availabilit… you need to fill this form:
http://argo.egi.eu/lavoisier/recomputation
and indicate:
- Your name and email
- the site(s) affected by the problem
- a description of the problem
- the profile affected
- the starting and ending time of the problem (including day and hour in UTC)
In case of problems with the web form, please submit a GGUS ticket to ARGO/SAM support unit providing the same information.
2) UMD 3.14.5 released today, including:
- umd-release 3.14.3, fixing an issue with GPG keys; details here: https://gist.github.com/pkoro/cc2ce75a0867a835f15d2f4d3fe50f44 (it doesn't affect new installations)
- gridsite 2.3.3, fixing an issue with proxy renewal on WMS https://ggus.eu/index.php?mode=ticket_info&ticket_id=124499
- VOMS 3.5.0, which makes RFC proxies the default for voms-proxy-init; an update of YAIM core handling RFC proxy as the new default
3) please start using UMD4/SL6 or UMD4/CentOS7 instead of UMD3/SL6
- Debian not used anymore, SL5 only security fixes, SL6 is available in UMD4 as well
- UMD4/SL6 contains products of UMD3/SL6 which give support for the next year at least, all the unsupported products are not in UMD4/SL6 (please let us know if we are missing specific products that we might have skipped!)
-- for some unsupported products, we are investigating how to replace them with equivalent products in UMD4/SL6 (see WMS)
-- list of all the products that are in UMD3 but not migrated to UMD4 is available, to be improved: https://wiki.egi.eu/wiki/UMD3_UMD4_products
4) new version of VAPOR (https://operations-portal.egi.eu/vapor/) will be released this month: it is an important tool for gathering and displaying the information published by the sites in the BDII, like for example the computing/storage capacities and many other things, and it replaced GSTAT.
Each NGI and Resource Centre should review the information provided by their sites and let us know any inconsistency: http://operations-portal.egi.eu/vapor/resources/GL2ResSummary . We need your feedback to improve the service.
- please test the version 2.2 by going on the dev instance http://operations-portal.egi.eu/vapor_dev
- report any comment, inconsistencies or suggestion for improvement into https://ggus.eu/index.php?mode=ticket_info&ticket_id=124872 where you can find details about how to test the version 2.2
----------------------------------------------------------------------------------------------------------------
link to this broadcast : https://operations-portal.egi.eu/broadcast/archive/1567
----------------------------------------------------------------------------------------------------------------
---------------------------------------------------------------------------------------------------------------
EGI BROADCAST TOOL : https://operations-portal.egi.eu/broadcast/send
---------------------------------------------------------------------------------------------------------------
Publication from : Peter Solagna <peter.solagna(a)egi.eu>
----------------------------------------------------------------------------------------------------------------
Dear EGI user,
starting in the next three days EGI resource centres will have to disable some functionalities in the HTC (grid) jobs execution environment, to mitigate a critical kernel vulnerability for which a patch is not yet available in the Red Hat based operating systems, and keep secure the EGI services. We expect that only a small fraction of the jobs will be affected.
The CE and SE interfaces (for example job submission) will regularly work but - depending on the system libraries used by the computing task submitted - some jobs may fail to execute.
Sites running a Red Hat based operating system, most of the EGI resource centres, will be affected.
In the coming days we expect that a security patch will be made available for the various RedHat flavours used in our infrastructure. As soon as the sites have upgraded, the resource centres will be fully functional again.
More information will be circulated in the coming two days.
Apologies for any inconvenience and thanks for your patience.
Regards
Peter Solagna on behalf of the EGI Operations
----------------------------------------------------------------------------------------------------------------
link to this broadcast : https://operations-portal.egi.eu/broadcast/archive/1557
----------------------------------------------------------------------------------------------------------------