fault detection using advanced analytics at cern's large hadron … · 2018. 11. 19. · •...

24
Fault Detection using Advanced Analytics at CERN's Large Hadron Collider Antonio RomeroMarín Manuel Martin Marquez USA - 27/01/2016 BIWA 16 1

Upload: others

Post on 12-Sep-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

FaultDetectionusingAdvancedAnalyticsatCERN'sLargeHadronCollider

AntonioRomeroMarínManuelMartinMarquez

USA- 27/01/2016 BIWA16 1

Page 2: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

What’sCERN

USA- 27/01/2016 BIWA16 2

Page 3: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

What’sCERN• EuropeanLaboratoryforParticlePhysics• FundamentalResearch

• WorldwideInternationalCollaboration• USisanimportantcontributortoCERNandtheexperiments

• Education&Training• PushFrontiersofTechnology

USA- 27/01/2016 BIWA16 3

Page 4: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

CERNopenlab• Public-privatepartnershipbetweenCERNandleadingICTcompaniesandresearchinstitutes• Acceleratecutting-edgesolutionsfortheworldwideLHCcommunityandwiderscientificresearch.• Designedtocreateanddisseminateknowledge• Publicationofreportsandarticles• Workshopsorseminars• CERNopenlab StudentProgramme

USA- 27/01/2016 BIWA16 4

Page 5: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

CERNAcceleratorComplex

USA- 27/01/2016 BIWA16 5

Page 6: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

DataAnalyticsChallenges• Somefaultscannotbeavoided

• Decreasetheavailabilityforrunning physics

• Preventivemaintenanceisnotenough• Doesnottakeintoaccounttheconditionoftheequipment

• LHCupgradeswillfurtherincreaseluminosity• Computing resourcesneedswillbehigher• Datageneratedwillincreasedrastically

USA- 27/01/2016 BIWA16 6

Page 7: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

DataAnalyticsObjective

USA- 27/01/2016 BIWA16 7

Page 8: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

Areasofinvestigation• Predictivemaintenanceandsystemoptimization• Dataextraction,transformationandloading(ETL)• DataVisualizationandDiscovery

USA- 27/01/2016 BIWA16 8

Page 9: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

Rfordataanalytics• Languageforstatisticalcomputingandgraphics• Powerfulintegratedsuiteofsoftwarefacilities

• Datamanipulation• Calculation• Graphicaldisplay,plotting

• FreeandOpenSource• Extensible

• Over7800CRANpackagesextendingbasefunctionality• Interactiveandprogrammaticenvironment• Greatusercommunity

• Activedevelopment,frequentlyupdated

USA- 27/01/2016 BIWA16 9

Page 10: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

ParallelProcessingApproachesinR• ParallelprocessingwithR• Foreach• Snow• Rmpi• BatchExperiments package(BatchJobs)

• SparkR• PackagedinSparkfrom1.4.0

• OracleREnterprise(ORE)

USA- 27/01/2016 BIWA16 10

Page 11: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

WhyORE?- OREbenefits• Adatabase-centricenvironmentforanalyticalprocessesinR• AllowstousethedatabaseservertorunRscripts(scalability&performance)• EliminatememoryconstraintofclientRengine• Rworkingondatadirectlyinthedatabase,nodatatransfer• Integration withtheSQLlanguage• Allowsin-databasedataanalysis• Providedataparallelism

• TransparencyLayer• TransparentlyanalyzeandusedatainOracleDatabasethroughR• NoneedtoknowSQL

USA- 27/01/2016 BIWA16 11

Page 12: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

CERNDatabaseactivity

USA- 27/01/2016 BIWA16 12

October2012 December2015Max size ACCLOG 136TB ACCLOG352TB

Maxredo ACCMEAS27TB/month QPSR115TB/month

Page 13: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

OREdeployment

USA- 27/01/2016 BIWA16 13

4xXeonE7-8895v2(15coreseach)2TBRAM

4.8TBFlash+6x1.2TB10KHDD

R ENTERPRISE

RAC– RealApplicationCluster

+

-EquipmentExperts-Operators

OracleDatabase

Engine

OracleDatabase

Engine

Page 14: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

CERNControlSystems• IoT andControlSystem

• Cryogenics• Vacuum• MachineProtection• PowerConverters• QPS

• AcceleratorLoggingService• ~275GB/day• Storingmorethan50TB/year• Dataacquisition

• CERNacceleratorcomplex• Relatedsubsystems• Experiments

• Around1millionsignals

USA- 27/01/2016 BIWA16 14

Page 15: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

LargestCryogenicsInstallation• 50kI/O,11kactuators, ~5k control loops• Control:

• ~100PLCs(Siemens,Schneider)• ~40FECs(industrial PCs)

• Supervision:26 SCADAservers

USA- 27/01/2016 BIWA16 15

Instrument/Actuators TotalTemperature[1.6– 300 K]Pressure[0– 20bar]LevelFlowControl valvesOn/Off valvesManualvalvesVirtual flowmetersControllers (PID)

10361230092326333692183519163254833

Page 16: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

UseCase:AnomalyDetectiononBeamScreen

USA- 27/01/2016 BIWA16 16

the resistive wall power losses dissipated by the beam and it intercepts syn-chrotron radiation and molecules in order to increase the quality of the beamvacuum [8]. Each beam screen is cooled by conduction through two smallcooling pipes, supplied by supercritical helium coming from the CryogenicDistribution Line of the LHC (QRL) at 3 bar and 4.6 K to avoid two-phaseflow regime. One beam screen is installed on each beam pipe in a half-cell ofthe LHC corresponding to three superconducting dipoles and one supercon-ducting quadrupole for a total length of 53 m.

Beam Screen

Cooling tube

Beam Pipe

Figure 2: LHC beam screen cooled by its two small cooling tubes

The initial cryogenic library provides only 0-D pipe models or very simpli-fied 1-D models where the ideal gas law is used. Hence, to model this specificsupercritical helium flow cooling the LHC beam screen, a new model had tobe developed reusing the facilities of the Ecosimpro cryogenic library. Thiskind of approach was already used previously to simulate a heat wave prop-agation along the LHC cryogenic distribution line after a quench [9] but inthis last paper, the model was based on simplifications for very low pressurehelium flows which are not valid for supercritical helium flows.

The following section describes the model of a supercritical helium flowthough small cooling pipes which will be then embedded in a larger systemwith other components from the cryogenic library in order to simulate thecomplete beam screen cooling circuits.

4.1. Modeling of a supercritical helium flow

A one dimensional compressible flow according to the x axis has to beconsidered as the isothermal compressibility is very high for supercritical

5

Source:EN-ICE(BenjaminBradu,EnriqueBlanco)

• PIDoutput(timeseries)segmentation• Characterization+Featureextraction• Featuresbasedclassification

Page 17: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

UseCase:AnomalyDetectiononBeamScreen

USA- 27/01/2016 BIWA16 17

STABLEBEAM BEAMDUMPSETUPINJECTBEAM

Page 18: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

UseCase:FaultyCryogenicsValveDetection• Whatistheobjective?

• Predictfaultyvalvesbeforetheyactuallyfail

• How?• Valvereceiveanapertureordervalue(apertureorder)• Effectiveaperturerealizedbythevalve(aperturemeasured)• Analyzing thedifferencebetweenboth(S =apertureorder- aperturemeasured)

13/08/2015 openlabSummerStudentsLecture13:DataMining,DataAnalyticsandBigData

18

Page 19: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

UseCase:FaultyCryogenicsValveDetection• Signalsused:

• S =apertureorder- aperturemeasured• FeaturesextractionsbasedonS

• Variance• Percentile99.9• Ropedistance– R(S)• NoiseBand – B(S)

• AutomaticFaultyValvesDetectionSystem• SVM- SupportVectorMachine

• Thelearningset44valves• apertureorder(%)• aperturemeasured (%)

USA- 27/01/2016 BIWA16 19

Page 20: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

Cryo Valves– ExtractFeaturesinORE

• Valvesdatasetwith6features• 10.5Mrows• 42Mrows• 168Mrows

USA- 27/01/2016 BIWA16 20

Page 21: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

Ourexperience• Movethedataisveryexpensive• ItisfastertoprocessitintheDBwithOREusingtheappropriatedegreeofparallelism

• DBnodesalreadypreparedfortheworkload• Memoryisaproblemforsmallclients• Simplifiestheinfrastructure

• Write/adaptRcodeisstraightforward• ThankstotransparencylayerandembeddedRexecution

USA- 27/01/2016 BIWA16 21

Page 22: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

Nextsteps• TestwithDBin-memoryfeatures• Benefitfromthehybridecosystem• GetthebestfromRDBMS+Hadoop

• Don’tforgetaboutcurrentapplicationlayerworkingwiththeDB• Deployinstreaming

• PredictiveModelMarkupLanguage(PMML)• Compatibleacrossmultiplelanguages(R,Python,Spark,Java,etc.)

USA- 27/01/2016 BIWA16 22

Page 23: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA

Conclusions• RecosystemisverygoodforDataAnalytics• OREoffersagoodsolutiontogetmorefromyourDBdeployment• Nopainadaptingcode• FollowsRstandards

• Avoidmovingdatawhenpossible• Hybridsystemslookquitepromisingtocovermultiplescenarios

USA- 27/01/2016 BIWA16 23

Page 24: Fault Detection using Advanced Analytics at CERN's Large Hadron … · 2018. 11. 19. · • Compatible across multiple languages (R, Python, Spark, Java, etc.) USA -27/01/2016 BIWA