Skip to content

Status Page

All Systems Operational

About This Page

This webpage is dedicated to providing live updates during service interruptions and advanced notifications for planned maintenance. At Therma°, we strive to provide accurate, up-to-date information on our services so you're never left guessing about the connection of your hardware. Our unwavering commitment to accurate data and transparent communication with our valued clients is paramount to us.

Therma° Server
tooltip

This refers to our main server used for collecting and reporting data from your hardware to the Therma° Dashboard. 

Operational
Cellular
tooltip

Our Therma° Monitoring Hubs transmit data through an international SIM card. If the cellular server is compromised, this status will update. 

Operational
IOS App
tooltip

This status refers to the operations of our IOS application.

Operational
Android App
tooltip

This status refers to the operations of our Android application.

Operational
Smart Thermostat
tooltip

Check here to see connection statuses of our IoT Thermostats. 

Operational
0

Upcoming Maintenance

2

Incidents - Last 30 Days

0

Maintenance - Last 30 Days

History

Therma° Monitoring Outage

Description

A critical component of the Monitoring application, RabbitMQ, broke down resulting in dependent microservices failing.

Timeline

October 18th, 2023, 10:28 AM UTC - Issue emerged.
October 18th, 2023, 6:58 PM UTC - Issue resolved.

October 18th, 2023, 6:58 PM UTC

[Update] Completed a comprehensive check of sensor connectivity. Issue confirmed fully resolved. 

October 18th, 2023, 6:24 PM UTC

[Update] Teams completed the recreation of the RabbitMQ shovels and restored connectivity for affected monitoring sensors. 

October 18th, 2023, 2:50 PM UTC

[Update] Issue identified. Teams are working to recreate RabbitMQ to the stable cluster to restore connectivity. 

October 18th, 2023, 10:28 AM UTC

[Update] Internal alerts notified teams of failing micro-services. Investigation launched. 

CoInspect Server Outage

Description

The Kubernetes cluster for production CoInspect was not able to create workers nodes which led to CoInspect servers being unreachable. This was caused by a combination of spot termination and a misconfiguration.

Timeline

October 17th, 2023, 9:59 AM UTC - Issue emerged.

October 17th, 2023, 12:10 PM UTC - Issue resolved.

October 17th, 2023, 12:10 PM UTC

[Update] This incident has been resolved, and the connection to servers has been fully restored. 

October 17th, 2023, 10:46 AM UTC 

[Update] The cause of the incident was identified, and the team started the rollout of the solution. 

October 17th, 2023, 9:59 AM UTC

[Update] Server health probes failed, which triggered an alert, notifying our IT team of the issue. Investigation launched. 

Cellular Outage

Description

A cut fiber cable resulted in a lack of cellular connectivity for Therma° Hubs. 

Timeline

September 19th, 2023, 4:36 pm UTC - Issue emerged.

September 19th, 2023, 10:01 pm UTC - Issue resolved.

September 19th, 2023, 10:01 pm UTC

[Update] This incident has been resolved, and the connection has been fully restored. 

September 19th, 2023, 8:01 pm UTC

[Update] After reviewing different approaches with our upstream partners, they have started routing traffic via a different link. 

September 19th, 2023, 5:29 pm UTC

[Update] Our upstream partners have traced the incident to a cut fiber cable and are working to reroute traffic. 

September 19th, 2023, 4:36 pm UTC

 

[Update] An Alert from one of our cellular partners notified our teams of the issue. Investigation launched with upstream partners. 

Cellular Outage

Description

An unexpected SIM outage occurred within our partner’s cellular network, affecting multiple hub devices on our network. This incident led to a disruption in Therma services. Our devices were still monitoring and reporting temperatures per equipment, but they were not being passed into our dashboard or sending temperature-related alerts.

Timeline

August 28th, 2023, 19:39 UTC - Issue emerged.

August 31st, 2023, 5:30 UTC - Issue resolved.

August 31st, 2023, 5:30 am UTC

[Update] Issue has been resolved.

August 30th, 2023, 7:30 am UTC

[Update] We observed devices having issues connecting due to congestion of incoming data during the recovery process. Cellular traffic slowly increased for online hardware. 

August 30th, 2023, 5:00 am UTC

 

[Update] After partner implemented a repeat of prior workarounds and traffic mitigations with the help of the interconnect provider, the interconnect provider now reverted the erroneous configuration change in their affected data center.

August 29th, 2023,  5:36 am UTC

[Update] Following failed attempts to restore service, including replacing signaling hardware links that were alerting and implementing temporary workarounds to increase signaling performance and capacity, partner executes their crisis management plan.

August 28th, 2023, 7:39 pm UTC

[Update] Issue has been identified. Upstream connectivity partners paged on-call engineering support, and additional engineering support was resourced to investigate and resolve the outage.