Question

Hi,

i'm with a trouble to take the api/monitor/alerts using prometheus.

i'm using prometheus 3.2.1 with IRIS 2022.1, the api metrics is working fine, but with the alerts, i'm receiving the following error:

and this is the answer in the request:

#API #Monitoring #REST API #InterSystems IRIS

0 3

0 67

Question

Anderson Negreli · Oct 31, 2024

Why does RAM usage differ between /api/monitor/metrics (Prometheus) and Zabbix?

I built a monitoring system in Grafana using the IRIS API /api/monitor/metrics (reading with Prometheus) but I noticed that the RAM usage shown was below that shown by the operating system.
I installed the Zabbix agent and the usage values were higher, but with a line with the same highs and lows but shifted.

#Monitoring #InterSystems IRIS

0 2

0 110

Question

Scott Roth · Oct 2, 2019

Trying to understand Orphaned Messages

We are constantly running into issues where there are billions of Orphaned messages in our system that cause problems, and we have to manually run a cleanup to fix performance issues.

In the following article about orphaned messages... https://community.intersystems.com/post/ensemble-orphaned-messages it mentions either programmatically eliminating the Orphaned messages or using a Utility like Demo.Util.CleanupSet in ENSDEMO.

#Monitoring #System Administration #Ensemble

0 7

2 875

Question

Fahima Ansari · Mar 19, 2024

Logging/Monitoring

In the Business Operation, how do we get to know which source is currently sending primary request if there are multiple requests coming at the same time?

#Business Operation #Monitoring #ObjectScript #System Administration #InterSystems IRIS for Health

0 6

0 158

Question

Cyril Grosjean · Nov 15, 2023

Testing MIB file with a Rest API

Hello,

In response to the infrastructure needs of our company's service, I've created a small API that sends SNMP queries to InterSystems to visualize relevant data for retrieval when the infrastructure implements monitoring.

However, I'm experiencing a timeout issue when attempting to collect information using an SNMP walk. Here is the code for my API's SNMP service:

#API #Docker #Monitoring #InterSystems IRIS for Health #Other

0 1

0 324

Question

João Carlos Azevedo · Oct 23, 2023

List of global referenced process, similar to $zreference

I need to develop a tool to help to get what data is being consumed by a certain process, in order to get all data used to build an automated test scenario.

For example, some user process will pull data from ^GLOBAL(1)="dataString", ^GLOBAL(2)="dataString2", ^GLOBAL1(1)="data1String", ^GLOBAL2(4)="data2String4". Amidst all other data on these Globals, I will ignore everything that was not used in the user process, and get the specific keys used on it.

#Globals #Monitoring #System Administration #Caché

1 12

0 449

Question

Sylvain Guilbaud · Aug 30, 2023

How to activate all AUDIT system events?

When installing IRIS, all the system AUDIT events are not enabled.

What is the fastest way to activate all events?

System > Security Management > System Audit Events

#Monitoring #Security #System Administration #InterSystems IRIS #InterSystems IRIS for Health

3 1

2 243

Question

Fabio Care · May 26, 2023

What is the %SYS.Monitor.Control.1 process doing with journal files?

In the Windows Ressource Manager I can observe multiple parallel processes coming from cache.exe with read operations to journaling files.

All except one of these processes have the same reads(Byte/s). The processes point to different journal files and constantly read between 200 and 3000 Bytes/s.

The corresponding process via PID in the management portal of Caché shows the process %SYS.Monitor.Control.1. In 3 days of uptime on the server it has run 181.632.583 commands and modified 32.140.642 globals.

#Journaling #Monitoring #Caché

0 2

0 231

Question

John Klahn · May 22, 2023

Where to build and configure existing "Monitor Home" web based production alert dashboard

My employer set up a web-based HL7 interface monitor dashboard that will display all Ensemble components (Service/Process/Operation) in a Production, their status, and the support information embedded in each interfaces listing on the Monitor. Please see 3 screenshots.

This is part of the URL that we go to when accessing this Web based Monitor: ......57772/csp/healthshare/monitor/Rush.Monitor.Web.Home.cls

#Monitoring #System Alerting and Monitoring (SAM) #Ensemble #InterSystems IRIS

0 3

0 239

Question

Jeffrey Drumm · May 5, 2023

Ens.Util.Statistics: LastActivity Value for Business Host

I'm using the EnumerateJobStatus query of class Ens.Util.Statistics to obtain the LastActivity value of a Business Host.

I would expect that this would return the timestamp of the last message received by the BH, understanding that any connect/disconnect activity would reset that timer. However, the time returned appears to actually be the time at which Ens.MonitorService generated the alert and is not directly related to anything that happened in the BH itself.

#Interoperability #Monitoring #Ensemble #InterSystems IRIS

0 1

0 186

Question

Rob Schoenmakers · Dec 21, 2022

In what .log file are my alerts saved?

Hello everybody,

In the documentation I read the following:

Alerts are messages generated by production components. InterSystems IRIS automatically writes the alerts to a log file and sends then to the production component named Ens.Alert. If your production does not have a component named Ens.Alert, then InterSystems IRIS writes alerts to the log file but does not send them to any component. The component named Ens.Alert can be of any class. The most frequently used classes for Ens.Alert are:

#Error Handling #Monitoring #InterSystems IRIS

0 3

1 311

Question

Jens Cheung · Aug 10, 2022

API Calling from Ensemble?

I'm trying to develop monitoring API for the following requirements:

#API #Monitoring #System Alerting and Monitoring (SAM) #Ensemble

1 2

1 421

Question

Mark OReilly · May 13, 2022

Creating a custom monitoring page

Hi:

Currently we are using an older Healthshare instance but I am not opposed to using IRIS as we will upgrade eventually.

Currently for monitoring productions we have a Montior screen. We have both the Queues page and a Deepsee dashboard which has current status of our services. The issue with the Deepsee method we currently have with traffic lights is 1) the page is a bit slow to load the metrics 2) any new services from the team a new widget needs created and although this is easy enough to do it just is time consuming.

#Angular2 #Monitoring #Caché

0 5

0 441

Question

Yuri Marx · May 13, 2022

Do you have cases using datadog to monitor intersystems products?

#Monitoring #InterSystems IRIS

0 1

0 463

Question

Michael Jobe · Jan 26, 2021

Bug in SAM Prometheus metrics endpoint

The current version of SAM creates Prometheus metric endpoints which appear to be handled correctly by the current prometheus scraper, however the metrics do not confirm to the current prometheus standard. The standard states:

#Monitoring #InterSystems IRIS

0 9

0 439

Question

Lucas Galdino · Jan 18, 2022

Using the ^MONMGR Utility

Hi everyone,

Im trying configure the Caché Monitor Manager (^MONMGR) utility for send alert e-mails.
Following the steps I have doubs to configure the options in "Set Server" to send e-mails for hotmail or outlook (smtp-mail.outlook.com).
I dont know how can I configure Mail server SSLConfiguration for hotmail or outlook.
Could you give me help?
Thank you!

#Beginner #Monitoring #Terminal #Tools #Caché #InterSystems IRIS

0 2

0 471

Question

Edward Jalbert · Jan 21, 2022

Interacting with Iris commands in linux script

We are running HealthShare on Linux Redhat via Azure.

A couple of days ago, the Azure server rebooted. Which we were unaware of.

Resulting in the Instance being in a downed status.

In the short term I put together a quick script to check the status, if it is down to restart it.

However, before I go down that road, I thought it would be best to inquire if there is a much better and more streamlined solution?

In a nutshell I just want to check and see if the Instance is up or in a state such as down or hung then start it.

#Monitoring #HealthShare

1 2

0 509

Question

David Foard · Dec 13, 2021

API Monitor Metrics - KPI recommendations?

Is there any recommended KPI's we should be using to monitor our IRIS configurations in Azure?

#Monitoring #InterSystems IRIS for Health

0 3

0 393

Question

Michael Jobe · Jan 26, 2021

SAM Add New Cluster Failed.

I installed a community version of Intersystems IRIS in a Large AWS EC2 instance to do some testing. I installed SAM and when I try to "Add a new cluster" I receive the following: "ERROR #5005: Cannot open file '/config/prometheus/isc_tmp_yml_file.yml'"

Screen shot included here: https://share.getcloudapp.com/RBu4K5yv

#AWS #Monitoring #System Alerting and Monitoring (SAM) #InterSystems IRIS

0 6

0 397

Question

Martin Staudigel · Dec 2, 2021

Enterprise Monitor - Dashboard requires Login after upgrade

Hello everybody,

after updating from 2018.2.1 to 2021.1 we observe a change in the behaviour of the Messagebank Enterprise Monitor.

In 2018.2.1, when clicking on a specific line inside the configured systems the system dashboard opened, giving insights about queue counts and error conditions.

#Monitoring #InterSystems IRIS

0 5

0 349

Question

Murray Oldfield · Mar 19, 2021

SAM - Hacks and Tips for set up and adding metrics from non-IRIS targets

#DevOps #Monitoring #System Alerting and Monitoring (SAM) #InterSystems IRIS

2 5

0 735

Question

Sergey Pavlov · Sep 3, 2021

SNMP fails with "Reason: (noSuchName) There is no such variable name in this MIB."

UPDATE:
It turns out it was just me being a dummy, and the snmpd was correctly telling me there is no value associated with that exact key. I should have used snmpwalk instead of snmpget to display the whole tree.

Original Post follows:

Hello!
I'm trying to set up SNMP monitoring on Caché, using documentation and this article

#Monitoring #Caché

0 1

0 4.4K

Question

Hao Ma · May 24, 2021

Regarding Ensemble message process time

I believe most of you have encounted this problem: a healthconnect/ensemble user get a slow response and ask measurement on how long it takes ensmeble to process this request, the ensemble 'activity data' gives no clue of the delay.

The reason is HealthConnect message measurement was based on ensemble message, which can’t give a correct answer on when ensmeble recevie the request and what time it send back response. when there is delay on inbound/outbound adpter, or csp gateway, there is no way to find out the delay from "activity data" .

#Monitoring #Ensemble #Health Connect

0 1

0 316

Question

Mary George · Nov 5, 2020

Global size monitoring

What is the best way to get the size and other storage details of individual Globals in a namespace?

Thanks,

Mary

#Globals #Monitoring #Caché #Ensemble #HealthShare #InterSystems IRIS for Health

2 4

1 818

Question

Han Ya · Sep 25, 2020

[help]SNMP Service can't connect with Caché SNMP agent(failed to open C:\InterSystems\Ensemble\mgr\snmpext.dat)

Whenever the Windows SNMP Service restarts, the snmpdbg log says the following.

16:58:25 :Debug tracing enabled for SNMP agent
16:58:25 :SnmpExtensionInit called, pid=4432, tid=12276
16:58:25 :CreateEvent for CacheSNMPTrap suceeded
16:58:25 :register Cache OID 1.3.6.1.4.1.16563.1
16:58:25 :Get all Cache configs ... 16:58:25 :found 1 configs
16:58:25 :Add ENSEMBLE config to list ...

#Monitoring #Ensemble

0 4

0 565

Question

Glenn van Bavel · May 4, 2017

SNMP Service can't connect with Caché SNMP agent

Whenever the Windows SNMP Service restarts, the snmpdbg log says the following.

13:08:59 :Attempting initial TCP connection(s) with 1 Cache instances ...
13:08:59 :Get connection with ENSEMBLE on port 1972
13:08:59 :Connection refused on port 1972, check if Cache instance ENSEMBLE is started.
13:08:59 :Cache iscsnmp.dll initialized for 1 configs

Ensemble and all productions are running. I've set up Caché SNMP agent on many other servers in our company and those are working fine. However this one server won't budge.

#Caché #Monitoring

0 10

0 1.3K

Question

Mary George · Sep 4, 2020

[Resolved] Task manager email notification

Hi,

I created a task from Management portal Task manager to use the Ens.Util.Tasks.Purge task . Task set up includes email notification setup for Completion email and error email.

This task is giving an error and no email is generated:

#Monitoring #System Administration #HealthShare

1 7

0 695

Question

Oliver Wilms · Aug 19, 2020

Sample Code for Free Space Monitor Task

Hello,

I just watched the recording of Michael Brady's presentation on Ensemble Disk Free Space Monitoring. Is the sample code for the Task definition class still available? How can I obtain a copy?

Thanks

#Monitoring #Ensemble

0 2

0 305

Question

Henrique Dias · Jun 29, 2020

Calling Custom Sensors using ZPM

Hi,

During the implementation of iris-history-monitor using ZPM, I'm bumping on the following scenario:

My Installer.cls has a call for the Custom Sensors Class method. The Custom information looks like a charm as I described in this article:

IRIS History Monitor using custom built-in REST API /api/monitor/metrics

But, now I'm trying to replicate the same behavior using the module.xml to work with ZPM.

#API #Monitoring #InterSystems IRIS

0 4

0 264

Question

Julian Matthews · Apr 27, 2020

Issues Monitoring Activity Volume

Hi all.

A long time ago I enabled Activity Monitoring to be able to save myself headaches in the future when looking at the performance of various message routes through our productions. It's served it's purpose of answering questions on how many messages we process a week etc but I had not had the chance to really dig down into the stats for specific message types or destinations to pin point issues.

#Monitoring #Performance #Ensemble

0 3

0 302