日本語
 
Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細

  ECOD: Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions

Li, Z., Zhao, Y., Hu, X., Botta, N., Ionescu, C., & Chen, G. H. (2022). ECOD: Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions. IEEE Transactions on Knowledge and Data Engineering. doi:10.1109/TKDE.2022.3159580.

Item is

基本情報

表示: 非表示:
資料種別: 学術論文

ファイル

表示: ファイル
非表示: ファイル
:
Li, Botta et al. ECOD Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions.pdf (全文テキスト(全般)), 4MB
 
ファイルのパーマリンク:
-
ファイル名:
Li, Botta et al. ECOD Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions.pdf
説明:
-
閲覧制限:
非公開
MIMEタイプ / チェックサム:
application/pdf
技術的なメタデータ:
著作権日付:
-
著作権情報:
-
CCライセンス:
-

関連URL

表示:

作成者

表示:
非表示:
 作成者:
Li, Zheng1, 著者
Zhao, Yue1, 著者
Hu, Xiyang1, 著者
Botta, Nicola2, 著者              
Ionescu, Cezar2, 著者
Chen, George H.1, 著者
所属:
1External Organizations, ou_persistent22              
2Potsdam Institute for Climate Impact Research, ou_persistent13              

内容説明

表示:
非表示:
キーワード: outlier detection; anomaly detection; distributed learning; scalability; empirical cumulative distribution function
 要旨: Outlier detection refers to the identification of data points that deviate from a general data distribution. Existing unsupervised approaches often suffer from high computational cost, complex hyperparameter tuning, and limited interpretability, especially when working with large, high-dimensional datasets. To address these issues, we present a simple yet effective algorithm called ECOD (Empirical-Cumulative-distribution-based Outlier Detection), which is inspired by the fact that outliers are often the “rare events” that appear in the tails of a distribution. In a nutshell, ECOD first estimates the underlying distribution of the input data in a nonparametric fashion by computing the empirical cumulative distribution per dimension of the data. ECOD then uses these empirical distributions to estimate tail probabilities per dimension for each data point. Finally, ECOD computes an outlier score of each data point by aggregating estimated tail probabilities across dimensions. Our contributions are as follows: (1) we propose a novel outlier detection method called ECOD, which is both parameter-free and easy to interpret; (2) we perform extensive experiments on 30 benchmark datasets, where we find that ECOD outperforms 11 state-of-the-art baselines in terms of accuracy, efficiency, and scalability; and (3) we release an easy-to-use and scalable (with distributed support) Python implementation for accessibility and reproducibility.

資料詳細

表示:
非表示:
言語: eng - 英語
 日付: 2022-03-052022-03-16
 出版の状態: オンラインで出版済み
 ページ: -
 出版情報: -
 目次: -
 査読: 査読あり
 識別子(DOI, ISBNなど): DOI: 10.1109/TKDE.2022.3159580
PIKDOMAIN: RD4 - Complexity Science
Organisational keyword: RD4 - Complexity Science
Model / method: Machine Learning
MDB-ID: No data to archive
 学位: -

関連イベント

表示:

訴訟

表示:

Project information

表示:

出版物 1

表示:
非表示:
出版物名: IEEE Transactions on Knowledge and Data Engineering
種別: 学術雑誌, SCI, Scopus
 著者・編者:
所属:
出版社, 出版地: -
ページ: - 巻号: - 通巻号: - 開始・終了ページ: - 識別子(ISBN, ISSN, DOIなど): CoNE: https://publications.pik-potsdam.de/cone/journals/resource/transactions-knowledge-data-engineering
Publisher: Institute of Electrical and Electronics Engineers (IEEE)