Alertables

The tables below contain a guide of what fields are available for each of the different intervals and types.

Node Statistics

The following fields are applicable for both the per-minute and hourly node level data.

Field Type Platform Description
Timestamp String All The date and time at for which the alert is generated.
Hostname String All The hostname of a device is a human-readable name that is used to identify the device on the network, rather than using its IP address.
CPU (%) Number All CPU utilization is a key performance metric used in system monitoring and analysis, as it provides insights into how effectively the CPU is being used to perform tasks. High CPU utilization may indicate that the CPU is working at or near its maximum capacity, which could potentially lead to performance degradation or system slowdowns. Low CPU utilization, on the other hand, may indicate that the CPU is underutilized and there may be room for optimization to improve system performance.
CPU Wait (%) Number All CPU wait time is an important metric in system performance analysis, as it indicates the amount of time that a CPU is not being utilized efficiently. High CPU wait times can indicate performance bottlenecks, such as slow I/O operations, inefficient memory access, or contention for system resources.
Memory (MB) Number All Memory (RAM) is used to temporarily store data and instructions that are being actively used or processed by the CPU of a system. RAM is a critical component of a computer system that plays a crucial role in enabling the CPU to quickly access and process data and instructions, facilitating the smooth operation of an operating system and running applications.
Buffer Cache Memory (MB) Number Linux only The Linux buffer cache is a part of the Linux kernel's file system caching mechanism, which aims to improve I/O performance by caching frequently accessed data in memory. It is an important feature of the Linux file system that helps improve overall system performance by reducing the latency of data access and minimizing the amount of I/O operations that need to be performed on storage devices.
Swap (MB) Number All Swap space is typically a portion of disk that is reserved for when a system runs out of physical memory. The disk space is used as an extension of the system's physical memory. The data that is swapped out to the swap space is stored in a special format called a swap file or swap partition, and it can be read back into RAM when it is needed again.
Total Memory (MB) Number All Total Memory is the size, in Megabytes of the available Memory on the system. Please see Memory for a more detailed description of its purpose.

Session Statistics

Field Type Platform Description
Timestamp String All The date and time at for which the alert is generated.
Hostname String All The hostname of a device is a human-readable name that is used to identify the device on the network, rather than using its IP address.
CPU (%) Number All This metric is used to measure the amount of CPU resources that is being consumed by a process. It can help identify processes that are using excessive CPU resources or causing performance issues. CPU utilization at the session level is measured as a percentage of a single CPU core. This means that this number can go above 100%.
Memory (MB) Number All This metric is used to measure the amount of RAM resources that are being consumed by a process, and can help identify processes that are using excessive memory resources or causing performance issues.
Data Read (MB) Number All This metric is used to measure the amount of I/O resources that are being consumed by a process, and can help identify processes that are reading excessive amounts of data from storage devices or causing I/O bottlenecks.
Data Written (MB) Number Linux, Windows This metric is used to measure the amount of I/O resources that are being consumed by a process, and can help identify processes that are reading excessive amounts of data from storage devices or causing I/O bottlenecks.
Job Name String All The name of the session as it is registered within the application.
Log Lines Number All Where a log file is associated with a process, the metric refers to the count of log messages that are recorded in a log file. Monitoring the number of log lines written in one minute can be useful for understanding the activity level of a system or application, as well as for troubleshooting issues or identifying patterns of behavior that may indicate a problem.
Major Faults Number Linux, Windows This metric refers to the count of hard faults. A hard fault occurs when a process attempts to access a page of memory that is not currently resident in RAM and must be loaded from disk. When a major fault occurs, the operating system must retrieve the necessary data from disk and load it into memory before allowing the process to continue. Monitoring major faults can be useful for identifying potential performance bottlenecks or issues with memory usage. If a process is generating a large number of major faults, it may indicate that it is frequently accessing data that is not currently resident in RAM, which can slow down overall system performance.
Minor Faults Number Linux, Windows This metric refers to the count of soft faults. A soft fault occurs when a process requests data from virtual memory that is not currently in physical memory, but is still present in the system's page cache or other buffers. Minor faults are a normal and expected part of the virtual memory management system used by modern operating systems, and are typically not a cause for concern.
Memory Share Number All This metric refers to the amount of memory that is shared between a specific process and other processes running on a computer system. Shared memory allows processes to communicate and share data with each other without the need for inter-process communication (IPC) mechanisms such as message passing or file I/O. When a process uses shared memory, a portion of its memory space is mapped to a common memory region that can be accessed by other processes. This shared memory can be used to share data or communicate between processes in a fast and efficient manner.
Peak CPU (%) Number All This metric refers to the maximum utilization of CPU for a process on a system in the given period of time.
Session Type String All This is the type of session as categorized within the application.
Total Data Read (MB) Number All This metric refers to the total amount of data read, to any device, by a process up to this period of time.
Total Data Written (MB) Number Linux, Windows This metric refers to the total amount of data written, to any device, by a process during the specified period of time.
User CPU (%) Number Linux, Windows This metric refers to the percentage of CPU that is being used by a process to execute user-level instructions.
System CPU (%) Number Linux, Windows This metric refers to the percentage of CPU that is being for system-level purposes. System-level processes are typically those that are responsible for managing the system's resources and providing essential services to user-level processes.
UTILLOC Size (MB) Number All This field is used to store two different numbers depending on the session's type. It can either be the size of SAS' UTILLOC which is a temporary location which enables threaded applications to store utility files. Or, it can be the size of the in-memory and disk-based CAS cache.
Virtual Memory (MB) Number All Virtual memory is the amount of memory that is allocated to a specific process running on a computer system. Virtual memory is a technique used by operating systems to allow a process to use more memory than is physically available on the system by temporarily transferring data to and from disk storage.
WORK Size (MB) Number All This field is used to store two different numbers depending on the session's type. It can either be the size of SAS' WORK location which is a temporary location used to store intermediate results during data processing. Or, it can be the size of the in-memory CAS cache.

CAS Action Executions

Field Type Platform Description
Timestamp String All The date and time at for which the alert is generated.
Hostname String All The hostname of a device is a human-readable name that is used to identify the device on the network, rather than using its IP address.
Action Set String All The action set is the parent group of a particular set of CAS actions. Examples include: clustering, deepLearn, regression, and svm
Action Name String All The action name is the name of the CAS action that is being executed. Examples include: kClus, dlTrain, logistic, and svmTrain
Action Details String All The details, if available of the action. This information may include options passed to the action. This data is returned as a string formatted as JSON.
Process PID Number All The process ID of the CAS session where the action was executed.
Start Timestamp String All The start time of the execution of the action.
End Timestamp String All The end time of the execution of the action.
User String All The user that executed the action.

SAS Procedure Executions

Field Type Platform Description
Timestamp String All The date and time at for which the alert is generated.
Hostname String All The hostname of a device is a human-readable name that is used to identify the device on the network, rather than using its IP address.
Proc Name String All The name of the SAS procedure that was executed.
Real Time Number All The duration of the procedure execution
Process PID Number All The process ID of the SAS session where the procedure was executed.
Start Timestamp String All The start time of the execution of the procedure.
End Timestamp String All The end time of the execution of the procedure.
User String All The user that executed the procedure.

Dataset Access Events

Field Type Platform Description
Timestamp String All The date and time at for which the alert is generated.
Hostname String All The hostname of a device is a human-readable name that is used to identify the device on the network, rather than using its IP address.
Access Time String All The date and time at which the dataset was accessed by the session.
Engine String All The SAS/ACCESS engine that was used to access the dataset.
Libref String All In SAS, a libref is a shorthand name that is assigned to a SAS library.
Mode String All The mode in which the dataset was accessed. Values are: READ, WRITE, and UPDATE.
Path String All The location of the data. For file based data like SAS7BDAT file, this will be the folder in which the dataset is stored, for databases, this could be the IP address/hostname of the database server or the Data Source Name (DSN).
Process PID String All The process ID of the SAS session which access the data
Table String All The name of the table that was accessed.

Log Events

Field Type Platform Description
Date & Time String All The date and time at for which the alert is generated.
Hostname String All The hostname of a device is a human-readable name that is used to identify the device on the network, rather than using its IP address.
Level String All The level of detail or severity of the message generated. Values can be: trace, debug, info, warn, error, severe, and fatal.
Line Number Number All The line number within the log file that the associated message is found.
Message String All An entry that captures information about an event or activity that has occurred within an application or system.
Timestamp String All The exact time that the message was written to the log.

Session Summary Statistics

Field Type Platform Description
Date & Time String All The date and time at for which the alert is generated.
Hostname String All The hostname of a device is a human-readable name that is used to identify the device on the network, rather than using its IP address.
Average CPU (%) Number All Measures the average CPU utilization for the lifetime of the session.
Average Memory (MB) Number All Measure the average amount of memory (RAM) used during the lifetime of the session.
Container ID String All For Kubernetes based deployments, this is the unique ID of the container in which the session was running.
Job Name String All The name of the session as it appears in the application.
Namespace String All For Kubernetes based deployments, this is the namespace in which the session (pod/container) ran.
Parent PID Number All The parent process of the session.
Peak CPU (%) Number All The maximum utilization of CPU for a session during its lifetime.
Peak Memory Number All The maximum utilization of memory for a session during its lifetime.
Peak UTILLOC Number All The maximum utilization of UTILLOC for a session during its lifetime.
Peak WORK Number All The maximum utilization of WORK for a session during its lifetime.
Pod Name String All For Kubernetes deployments, this is the unique name of the pod in which the session ran.
Priority Number All The numerical representation of the priority with which the session was started.
Processor Number All The processor ID of the processor on which the session was initially scheduled.
Queue String All For SAS Grid deployments, this is the LSF queue that the session was executed in.
Session Type String All The category of process that this session belongs to.
Total Data Read (MB) Number All The total amount of data read, from any device, during the lifetime of the session.
Total Data Written (MB) Number All The total amount of data written, to any device, during the lifetime of the session.
Total Duration (ms) Number All The total duration of the session in milliseconds.
Total Log Lines Number All The total number of log lines written during the lifetime of the session.
User String All The username of the person that owned the session.

Last update: April 24, 2023
Created: April 24, 2023