Using artificial intelligence to find anomalies hiding in massive datasets

Identifying a malfunction in the nation’s power grid can be like trying to find a needle in an enormous haystack. Hundreds of thousands of interrelated sensors spread across the U.S. capture data on electric current, voltage, and other critical information in real time, often taking multiple recordings per second.

Researchers at the MIT-IBM Watson AI Lab have devised a computationally efficient method that can automatically pinpoint anomalies in those data streams in real time. They demonstrated that their artificial intelligence method, which learns to model the interconnectedness of the power grid, is much better at detecting these glitches than some other popular techniques.

Because the machine-learning model they developed does not require annotated data on power grid anomalies for training, it would be easier to apply in real-world situations where high-quality, labeled datasets are often hard to come by. The model is also flexible and can be applied to other situations where a vast number of interconnected sensors collect and report data, like traffic monitoring systems. It could, for example, identify traffic bottlenecks or reveal how traffic jams cascade.

“In the case of a power grid, people have tried to capture the data using statistics and then define detection rules with domain knowledge to say that, for example, if the voltage surges by a certain percentage, then the grid operator should be alerted. Such rule-based systems, even empowered by statistical data analysis, require a lot of labor and expertise. We show that we can automate this process and also learn patterns from the data using advanced machine-learning techniques,” says senior author Jie Chen, a research staff member and manager of the MIT-IBM Watson AI Lab.

The co-author is Enyan Dai, an MIT-IBM Watson AI Lab intern and graduate student at the Pennsylvania State University. This research will be presented at the International Conference on Learning Representations.

Probing probabilities

The researchers began by defining an anomaly as an event that has a low probability of occurring, like a sudden spike in voltage. They treat the power grid data as a probability distribution, so if they can estimate the probability densities, they can identify the low-density values in the dataset. Those data points which are least likely to occur correspond to anomalies.

Estimating those probabilities is no easy task, especially since each sample captures multiple time series, and each time series is a set of multidimensional data points recorded over time. Plus, the sensors that capture all that data are conditional on one another, meaning they are connected in a certain configuration and one sensor can sometimes impact others.

To learn the complex conditional probability distribution of the data, the researchers used a special type of deep-learning model called a normalizing flow, which is particularly effective at estimating the probability density of a sample.

They augmented that normalizing flow model using a type of graph, known as a Bayesian network, which can learn the complex, causal relationship structure between different sensors. This graph structure enables the researchers to see patterns in the data and estimate anomalies more accurately, Chen explains.

“The sensors are interacting with each other, and they have causal relationships and depend on each other. So, we have to be able to inject this dependency information into the way that we compute the probabilities,” he says.

This Bayesian network factorizes, or breaks down, the joint probability of the multiple time series data into less complex, conditional probabilities that are much easier to parameterize, learn, and evaluate. This allows the researchers to estimate the likelihood of observing certain sensor readings, and to identify those readings that have a low probability of occurring, meaning they are anomalies.

Their method is especially powerful because this complex graph structure does not need to be defined in advance — the model can learn the graph on its own, in an unsupervised manner.

A powerful technique

They tested this framework by seeing how well it could identify anomalies in power grid data, traffic data, and water system data. The datasets they used for testing contained anomalies that had been identified by humans, so the researchers were able to compare the anomalies their model identified with real glitches in each system.

Their model outperformed all the baselines by detecting a higher percentage of true anomalies in each dataset.

“For the baselines, a lot of them don’t incorporate graph structure. That perfectly corroborates our hypothesis. Figuring out the dependency relationships between the different nodes in the graph is definitely helping us,” Chen says.

Their methodology is also flexible. Armed with a large, unlabeled dataset, they can tune the model to make effective anomaly predictions in other situations, like traffic patterns.

Once the model is deployed, it would continue to learn from a steady stream of new sensor data, adapting to possible drift of the data distribution and maintaining accuracy over time, says Chen.

Though this particular project is close to its end, he looks forward to applying the lessons he learned to other areas of deep-learning research, particularly on graphs.

Chen and his colleagues could use this approach to develop models that map other complex, conditional relationships. They also want to explore how they can efficiently learn these models when the graphs become enormous, perhaps with millions or billions of interconnected nodes. And rather than finding anomalies, they could also use this approach to improve the accuracy of forecasts based on datasets or streamline other classification techniques.

This work was funded by the MIT-IBM Watson AI Lab and the U.S. Department of Energy.

International Conference on Learning Representations article: https://openreview.net/forum?id=45L_dgP48Vd

Note: This article have been indexed to our site. We do not claim legitimacy, ownership or copyright of any of the content above. To see the article at original source Click Here

AstroAI Tire Inflator Portable Air Compressor Tire Air Pump for Car Tires - Car Accessories, 12V DC Auto Pump with Digital Pressure…

(94067)

$29.99 (as of December 16, 2024 19:11 GMT +00:00 - )

Crocs Unisex Adult Classic Clog

(608713)

$40.86 (as of December 16, 2024 19:32 GMT +00:00 - )

THEMEROL Stocking Stuffers for Teens Boys Gift Ideas Teenage Boys Christmas Gifts Son 14 16 18 Year Old Birthday Beaded Bracelets Cool Unique Men Gifts Valentines Easter Basket Graduation Confirmation

(755)

$14.99 (as of December 16, 2024 19:32 GMT +00:00 - )

Apple AirPods 4 Wireless Earbuds, Bluetooth Headphones, with Active Noise Cancellation, Adaptive Audio, Transparency Mode, Personalized Spatial Audio, USB-C Charging Case, Wireless Charging, H2 Chip

(1035)

$151.79 (as of December 16, 2024 19:03 GMT +00:00 - )

Sweet Water Decor Warm and Cozy Candle - Pine Cinnamon & Fir Winter Scented Orange Candle - Scented Soy Candles for Home with 40 Hour Burn Time - 9oz Clear Jar Winter Candle Made in the USA

(2899)

$15.89 (as of December 16, 2024 19:32 GMT +00:00 - )

Index Of News Author

Science and Medical

US astronaut to ride Russian spacecraft home during tensions

CAPE CANAVERAL, Fla. (AP) — U.S. astronaut Mark Vande Hei has made it through nearly a year in space, but faces what could be his trickiest assignment yet: riding a Russian capsule back to Earth in the midst of deepening tensions between the countries.NASA insists Vande Hei’s homecoming plans at the end of the month…

March 14, 2022

Science and Medical

Long March 7A rocket deploys two Chinese tech demo satellites

A Long March 7A rocket lifts off Dec. 23 from the Wenchang space center. Credit: CASCChina launched two classified Shiyan satellites Dec. 23 into a geostationary transfer orbit aboard a Long March 7A rocket, one of the country’s newest launch vehicles. The mission took off from China’s Wenchang launch base on Hainan Island. The 199-foot-tall…

December 29, 2021

Science and Medical

How the US used science to wage psychological war

Shutterstock/Pixels Hunter I have a new book coming out in June, and New Scientist is to blame. Back in the summer of 2020, I wrote this column about a study led by computational social scientist Meysam Alizadeh, who worked to algorithmically predict waves of propaganda on Twitter and Reddit. Alizadeh said he and his colleagues

May 15, 2024

Science and Medical

Two years of covid-19: What we’ve learned during the pandemic so far

It's now been two years since Chinese authorities first informed the World Health Organization about an unknown virus in Wuhan. How has our understanding of the virus changed since then and where does that leave us? Health 31 December 2021 By Helen Thomson A new hospital was rapidly built in Wuhan, China, in early 2020STR/AFP…

December 31, 2021

Science and Medical

Meet Gaiasia Jennyae, the swamp creature with a toilet seat-shaped head

Forty million years before the emergence of dinosaurs, there existed a fierce predator that dwelled in marshy areas. With a skull measuring over two feet in length, it patiently waited with its jaws wide open, ready to seize any unsuspecting prey that crossed its path. Introducing Gaiasia Jennyae, the creature of the swamp with a

July 4, 2024

Science and Medical

How to disable reading goals in the Books app

Image: IDG The notification was straightforward enough from the Books app on my iPhone: “Today’s reading goal achieved. Congratulations, you’ve reached your daily reading goal.” A big blue checkmark accompanied it. The only problem? I had never set a reading goal. We get enough pressure from reminders, alarms, timers, calendar events, and other popups and

June 20, 2024

Hand-Picked Top-Read Stories

EFCC arrests man for allegedly defrauding 139 Australians

Exclude TETFUND, NITDA, NASENI from proposed Tax Reform Bill, Coalition of Northern Group urges FG

Inside Silencio: Sujimoto’s Secret Nightclub Where Billionaires, Movie Stars Escape LagosNightlife

Trending Tags

Using artificial intelligence to find anomalies hiding in massive datasets

AstroAI Tire Inflator Portable Air Compressor Tire Air Pump for Car Tires - Car Accessories, 12V DC Auto Pump with Digital Pressure…

Crocs Unisex Adult Classic Clog

THEMEROL Stocking Stuffers for Teens Boys Gift Ideas Teenage Boys Christmas Gifts Son 14 16 18 Year Old Birthday Beaded Bracelets Cool Unique Men Gifts Valentines Easter Basket Graduation Confirmation

Apple AirPods 4 Wireless Earbuds, Bluetooth Headphones, with Active Noise Cancellation, Adaptive Audio, Transparency Mode, Personalized Spatial Audio, USB-C Charging Case, Wireless Charging, H2 Chip

Sweet Water Decor Warm and Cozy Candle - Pine Cinnamon & Fir Winter Scented Orange Candle - Scented Soy Candles for Home with 40 Hour Burn Time - 9oz Clear Jar Winter Candle Made in the USA

Marshawn Kneeland NFL Draft 2024: Scouting Report for Western Michigan EDGE

The Download: the mortality issue, and America’s new favorite shopping app

Breathedge 2 brings humor back to outer space survival

Five Congress MLAs in Meghalaya have decided to join the ruling Meghalaya Democratic Alliance

OnePlus 10T Pro kan redan åkt fast på bild

EFCC arrests man for allegedly defrauding 139 Australians

Exclude TETFUND, NITDA, NASENI from proposed Tax Reform Bill, Coalition of Northern Group urges FG

Inside Silencio: Sujimoto’s Secret Nightclub Where Billionaires, Movie Stars Escape LagosNightlife

Foreign investor lawsuits impede Honduras human rights & environment protections

Water returns to Amazon rivers amid historic drought

Using artificial intelligence to find anomalies hiding in massive datasets

Related Posts