ECOD: Unsupervised Outlier Detection Using Empirical Cumulative Distribution 
Functions

Li, Zheng; Zhao, Yue; Hu, Xiyang; Botta, Nicola; Ionescu, Cezar; Chen, George H.

doi:10.1109/TKDE.2022.3159580

???ViewItemFull_btnItemView??????ViewItemOverview_lblLinkOverviewPage???

ECOD: Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions

Li, Z., Zhao, Y., Hu, X., Botta, N., Ionescu, C., Chen, G. H. (2023): ECOD: Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions. - IEEE Transactions on Knowledge and Data Engineering, 35, 12, 12181-12193.
https://doi.org/10.1109/TKDE.2022.3159580

Item is ???ENUM_STATE_RELEASED???

???ViewItemFull_lblShowGroup??? ???ViewItemFull_lblAll??? ???ViewItemFull_lblHideGroup??? ???ViewItemFull_lblAll???

???ViewItemFull_lblBasic???

???ViewItemFull_lblShowGroup??? ???ViewItemFull_lblHideGroup???

???ViewItemFull_lblCiteItemAs???: https://publications.pik-potsdam.de/pubman/item/item_26927 ???ViewItemFull_lblCiteItemVersionAs???: https://publications.pik-potsdam.de/pubman/item/item_26927_4

???ViewItemFull_lblGenre???: ???ENUM_GENRE_ARTICLE???

???ViewItemMedium_lblSubHeaderFile???

???ViewItemFull_lblShowGroup??? ???ViewItemMedium_lblSubHeaderFile???

???ViewItemFull_lblHideGroup??? ???ViewItemMedium_lblSubHeaderFile???

:

Li, Botta et al. ECOD Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions.pdf (???ENUM_CONTENTCATEGORY_any-fulltext???), 4???ViewItemMedium_lblFileSizeMB???

???ViewItemFull_lblCiteFileAs???:
???lbl_noEntry???

???ViewItemMedium_lblFileName???:
Li, Botta et al. ECOD Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions.pdf

???ViewItemMedium_lblFileDescription???:
???lbl_noEntry???

???ViewItemMedium_lblFileOaSatus???:

???ViewItemMedium_lblFileVisibility???:
???ENUM_VISIBILITY_PRIVATE???

???ViewItemFull_lblFileMimeTypeSize???:
application/pdf

???ViewItemFull_lblTechnicalMetadata???:

???ViewItem_lblCopyrightDate???:
???lbl_noEntry???

???ViewItem_lblCopyrightInfo???:
???lbl_noEntry???

???ViewItemFull_lblFileLicense???:
???lbl_noEntry???

???ViewItemFull_lblSubHeaderLocators???

???ViewItemFull_lblShowGroup???

???ViewItemFull_lblCreators???

???ViewItemFull_lblShowGroup???

???ViewItemFull_lblHideGroup???

???ViewItemFull_lblCreators???:
Li, Zheng¹, ???ENUM_CREATORROLE_AUTHOR???
Zhao, Yue¹, ???ENUM_CREATORROLE_AUTHOR???
Hu, Xiyang¹, ???ENUM_CREATORROLE_AUTHOR???
Botta, Nicola², ???ENUM_CREATORROLE_AUTHOR???
Ionescu, Cezar¹, ???ENUM_CREATORROLE_AUTHOR???
Chen, George H.¹, ???ENUM_CREATORROLE_AUTHOR???

???ViewItemFull_lblAffiliations???:
1External Organizations, ou_persistent22
2Potsdam Institute for Climate Impact Research, ou_persistent13

???EditItem_lblContent???

???ViewItemFull_lblShowGroup???

???ViewItemFull_lblHideGroup???

???ViewItemFull_lblSubject???: outlier detection; anomaly detection; distributed learning; scalability; empirical cumulative distribution function

???ViewItemFull_lblAbstract???: Outlier detection refers to the identification of data points that deviate from a general data distribution. Existing
unsupervised approaches often suffer from high computational cost, complex hyperparameter tuning, and limited interpretability,
especially when working with large, high-dimensional datasets. To address these issues, we present a simple yet effective algorithm
called ECOD (Empirical-Cumulative-distribution-based Outlier Detection), which is inspired by the fact that outliers are often the “rare
events” that appear in the tails of a distribution. In a nutshell, ECOD first estimates the underlying distribution of the input data in a
nonparametric fashion by computing the empirical cumulative distribution per dimension of the data. ECOD then uses these empirical
distributions to estimate tail probabilities per dimension for each data point. Finally, ECOD computes an outlier score of each data point
by aggregating estimated tail probabilities across dimensions. Our contributions are as follows: (1) we propose a novel outlier detection
method called ECOD, which is both parameter-free and easy to interpret; (2) we perform extensive experiments on 30 benchmark
datasets, where we find that ECOD outperforms 11 state-of-the-art baselines in terms of accuracy, efficiency, and scalability; and (3)
we release an easy-to-use and scalable (with distributed support) Python implementation for accessibility and reproducibility.

???ViewItemFull_lblSubHeaderDetails???

???ViewItemFull_lblShowGroup???

???ViewItemFull_lblHideGroup???

???ViewItemFull_lblLanguages???: eng - English

???ViewItemFull_lblDates???: ???ViewItem_lblDateAccepted???: 2022-03-05???ViewItem_lblDatePublishedOnline???: 2022-03-16???ViewItem_lblDatePublishedInPrint???: 2023-12-01

???ViewItemFull_lblPublicationStatus???: ???ViewItem_lblPublicationState_published-in-print???

???ViewItemFull_lblPages???: ???lbl_noEntry???

???ViewItemFull_lblPublishingInfo???: ???lbl_noEntry???

???ViewItemFull_lblTOC???: ???lbl_noEntry???

???ViewItemFull_lblRevisionMethod???: ???ENUM_REVIEWMETHOD_PEER???

???ViewItemFull_lblIdentifiers???: ???ENUM_IDENTIFIERTYPE_DOI???: 10.1109/TKDE.2022.3159580
???ENUM_IDENTIFIERTYPE_PIKDOMAIN???: RD4 - Complexity Science
???ENUM_IDENTIFIERTYPE_ORGANISATIONALK???: RD4 - Complexity Science
???ENUM_IDENTIFIERTYPE_MODELMETHOD???: Machine Learning
???ENUM_IDENTIFIERTYPE_MDB_ID???: No data to archive

???ViewItemFull_lblDegreeType???: ???lbl_noEntry???

???ViewItemFull_lblSourceTitle???: IEEE Transactions on Knowledge and Data Engineering

???ViewItemFull_lblSourceGenre???: ???ENUM_GENRE_JOURNAL???, SCI, Scopus

???ViewItemFull_lblSourceCreators???:

???ViewItemFull_lblSourceAffiliations???:

???ViewItemFull_lblSourcePubInfo???: ???lbl_noEntry???

???ViewItemFull_lblPages???: ???lbl_noEntry??? ???ViewItemFull_lblSourceVolumeIssue???: 35 (12) ???ViewItemFull_lblSourceSequenceNo???: ???lbl_noEntry??? ???ViewItemFull_lblSourceStartEndPage???: 12181 - 12193 ???ViewItemFull_lblSourceIdentifier???: ???ENUM_IDENTIFIERTYPE_CONE???: https://publications.pik-potsdam.de/cone/journals/resource/transactions-knowledge-data-engineering
???ENUM_IDENTIFIERTYPE_PUBLISHER???: Institute of Electrical and Electronics Engineers (IEEE)

???ViewItemPage???

???ViewItemFull_lblBasic???

???ViewItemMedium_lblSubHeaderFile???

???ViewItemFull_lblSubHeaderLocators???

???ViewItemFull_lblCreators???

???EditItem_lblContent???

???ViewItemFull_lblSubHeaderDetails???

???ViewItemFull_lblSubHeaderEvent???

???ViewItemFull_lblSubHeaderLegalCase???

???g_project_info???

???ViewItemFull_lblSubHeaderSource??? 1