Incorrect data:
Most of big data is unstructured and unstructured data is difficult to make sense of. It is very likely the system will not be complex enough to understand it fully. It may be interpreted wrongly resulting in nonsensical output or it may contain incorrectly spelled words or grammatically incorrect sentences which we cannot use easily.
Incorrect/missing out data:
Another problem is that one person out of the data set may have recorded information incorrectly and so it is not a true data set therefore making analysis of the incorrect data meaningless for real life. Data produced from people can be biased, this leads to inaccurate predictions. People may have been paid to give out certain information which can be incorrect e.g. sponsored content is often ingenuine and this can reduce the effectiveness of the predictions as we can't tell if the opinions were true or not.
Correct data:
No comments:
Post a Comment