我有一个非常小的问题让我困惑了一段时间。我有一个有趣功能的数据集,但其中一些是无量纲的数量(我已尝试过使用z分数),但它们使事情变得更糟。这些是:
Timestamps (Like YYYYMMDDHHMMSSMis) I am getting the last 9 chars from this.
User IDs (Like in a Hash form) How do I extract meaning from them?
IP Addresses (You know what those are). I only extract the first 3 chars.
City (Has an ID like 1,15,72) How do I extract meaning from this?
Region (Same as city) Should I extract meaning from this or just leave it?
剩下的事情是价格,宽度和高度。任何帮助或见解将不胜感激。谢谢。
答案 0 :(得分:1)