网友您好，请在下方输入框内输入要搜索的题目：

搜题

题目内容（请给出正确答案）

提问人：网友okboy09 发布时间：2022-01-07

[主观题]

An outlier is a data object that deviates significantly from the rest of the objects, as if it were generated by a different mechanism.

简答题官方参考答案（由简答题聘请的专业题库老师提供的解答）

抱歉！暂无答案，正在努力更新中……

更多“An outlier is a data object that deviates significantly from the rest of the objects, as if it were …”相关的问题

第1题

Distance-based outlier Mining is not suitable to data set that does not fit any standard distribution model.

点击查看答案

第2题

Data mining is an（66)research field in database and artificial intelligence. In this paper

Data mining is an(66)research field in database and artificial intelligence. In this paper, the data mining techniques are introduced broadly including its producing background, its application and its classification. The principal techniques used in the data mining are surveyed also, which include rule induction, decision(67), artificial(68)network, genetic algorithm, fuzzy technique, rough set and visualization technique. Association rule mining, classification rule mining, outlier mining and clustering method are discussed in detail. The research achievements in association rule, the shortcomings of association rule measure standards and its(69), the evaluation methods of classification rules are presented. Existing outlier mining approaches are introduced which include outlier mining approach based on statistics, distance-based outlier mining approach, data detection method for deviation, rule-based outlier mining approach and multi-strategy method. Finally, the applications of data mining to science research, financial investment, market, insurance, manufacturing industry and communication network management are introduced. The application(70)of data mining are described.

A．intractable

B．emerging

C．easy

D．scabrous

点击查看答案

第3题

Which one is wrong about clustering and outliers？

A、Clustering belongs to supervised learning.

B、Principles of clustering include maximizing intra-class similarity and minimizing interclass similarity.

C、Outlier analysis can be useful in fraud detection and rare events analysis.

D、Outlier means a data object that does not comply with the general behavior of the data.

点击查看答案

第4题

Assignment 6 - Outlier mining You are required to ...

Assignment 6 - Outlier mining You are required to use outlier mining methods to detect the outliers with given data sets. In a section of a city road, several cameras are set to collect the plate of vehicles from 2017-06-09 to 2017-06-12, as well as the date and time when passing the start point and the finish point. Travel time is calculated later. Time serial is another form of transformation from start time. So each instance contains 8 attributes, including serial number, license plate number, date and time passing start/end point, time serial and travel time. There are totally 4977 instances. You need to finish the following tasks. Task: (1) Use statistic-based approach to detect the outliers of travel time. Calculate the mean value and the variance of travel time. Write out the confidence interval. Take time serial as X-axis and the travel time as Y-axis. Plot the scatter diagram and mark the outliers you have recognized. (2) Use distance-based approach to detect the outliers of travel time. An object o in data set D is defined as an outlier with parameters r and π described as DB(r,π), if a fraction of the objects in D lie at a distance less than r from o is less than π, o is an outlier. Let parameter r vary from 0.1 to 0.3 with the step of 0.1, and π vary from 30 to 90 with the step of 30, find the outliers and the number of the outliers. You can use the Euclidian distance. (3) Use density-based approach to detect the outliers of travel time. With different k (from 3 to 400 with the step of 5), the number of neighbors, calculate the LOF for each data point. Set 2.0 as a threshold for LOF and an object is labeled as an outlier if its LOF exceeds 2.0. Firstly, take k value as X-axis and the number of outliers as Y-axis. Plot the line chart. Secondly, calculate the LOF for each data point and give the top 4 outliers. Use k=350 and the Euclidian distance.

点击查看答案

第5题

Outlier arithmetics such as Isolation Forest can be used to detect traffic incident.

点击查看答案

第6题

If you cannot find a reason for an outlier or remove it, you should use the mean and IQR to summarize the center and spread.

点击查看答案

第7题

Which of the following is most affected by an outlier （extreme value)？

A、mean

B、median

C、mode

D、none of the above

点击查看答案

第8题

What is application case of outlier mining？

A、Traffic incident detection

B、Credit card fraud detection

C、Network intrusion detection

D、Medical analysis

点击查看答案

第9题

在一个n维的空间中，最好的检测outlier（离群点)的方法是（)A.作正态分布概率图B.作盒形图C.马氏距

在一个n维的空间中，最好的检测outlier(离群点)的方法是()

A.作正态分布概率图

B.作盒形图

C.马氏距离

D.作散点图

点击查看答案

第10题

How to pick the right k by a heuristic method for density based outlier mining method？

A、K should be at least 10 to remove unwanted statistical fluctuations.

B、Pick 10 to 20 appears to work well in general.

C、Pick the upper bound value for k as the maximum of “close by” objects that can potentially be global outliers.

D、Pick the upper bound value for k as the maximum of “close by” objects that can potentially be local outliers.

点击查看答案

账号：尚未登录

登录没有账号？去注册

搜题记录

联系客服

购买搜题卡

考试指南全部 >

2024年自考本科考试时间具体时间安排一览表 11省2024年10月自考报名时间及考试时间一览表成人自考本科2024年报名时间什么时候开始报考 2024年自考全国统一考试时间安排具体开考时间是几号 2024想自学考试怎么报名在哪里报名 2024自考本科在哪个网站报名详细报考流程是什么 2024自考本科什么时候报名怎么报名 2024年中药学自考本科考哪几门合格标准是多少成人高考和自考哪个含金量高有什么区别自学考试报名时间安排2024年考哪些内容

购买搜题卡查看答案

购买前请仔细阅读《购买须知》

请选择支付方式

微信支付

支付宝支付

点击支付即表示你同意并接受《服务协议》和《购买须知》

立即支付已付款，但不能查看答案，请点这里登录即可>>

搜题卡使用说明

1. 搜题次数扣减规则：

功能	扣减规则
功能	基础费（查看答案）	加收费（AI功能）
文字搜题、查看答案	1/每题	0/每次
语音搜题、查看答案	1/每题	2/每次
单题拍照识别、查看答案	1/每题	2/每次
整页拍照识别、查看答案	1/每题	5/每次