neo4j所有朋友的财产平均值

时间:2017-11-29 15:28:04

标签: graph neo4j cypher

有一个图表:

enter image description here

CREATE (Alice:Person {id:'a', fraud:1})
    CREATE (Bob:Person {id:'b', fraud:0})
    CREATE (Charlie:Person {id:'c', fraud:0})
    CREATE (David:Person {id:'d', fraud:0})
    CREATE (Esther:Person {id:'e', fraud:0})
    CREATE (Fanny:Person {id:'f', fraud:0})
    CREATE (Gabby:Person {id:'g', fraud:0})
    CREATE (Fraudster:Person {id:'h', fraud:1})


CREATE
  (Alice)-[:CALL]->(Bob),
  (Bob)-[:SMS]->(Charlie),
  (Charlie)-[:SMS]->(Bob),
  (Fanny)-[:SMS]->(Charlie),
  (Esther)-[:SMS]->(Fanny),
  (Esther)-[:CALL]->(David),
  (David)-[:CALL]->(Alice),
  (David)-[:SMS]->(Esther),
  (Alice)-[:CALL]->(Esther),
  (Alice)-[:CALL]->(Fanny),
  (Fanny)-[:CALL]->(Fraudster)

尝试查询时:

MATCH (a)-->(b)
WHERE b.fraud = 1
RETURN (count() / ( MATCH (a) -->(b) RETURN count() ) * 100)

我想计算用户的欺诈行为(因为欺诈只有0或1被定义为所有连接节点欺诈级别的平均值:

MATCH ()--(f)
RETURN f.id, f.fraud, COUNT(*), COLLECT(f) AS fs

返回正确数量的朋友,但无法访问这些朋友,即在collect语句中只访问节点本身:

╒══════╤═════════╤══════════════╤══════════╤══════════════════════════════════════════════════════════════════════╕
│"f.id"│"f.fraud"│"avg(f.fraud)"│"COUNT(*)"│"fs"                                                                  │
╞══════╪═════════╪══════════════╪══════════╪══════════════════════════════════════════════════════════════════════╡
│"h"   │1        │1             │1         │[{"fraud":1,"id":"h"}]                                                │
├──────┼─────────┼──────────────┼──────────┼──────────────────────────────────────────────────────────────────────┤
│"f"   │0        │0             │4         │[{"fraud":0,"id":"f"},{"fraud":0,"id":"f"},{"fraud":0,"id":"f"},{"frau│
│      │         │              │          │d":0,"id":"f"}]                                                       │
....

即。天真地计算平均值

MATCH ()--(f)
RETURN f.id, avg(f.fraud)

只考虑这个单一节点,而不考虑网络。 如何考虑节点的社交网络(最多定义深度,即此处为1)以改善neo4j percentage of attribute for social network的原始答案

修改

MATCH p = ()--()
UNWIND nodes(p) AS f
RETURN f.id, f.fraud, COUNT(*), COLLECT({id: f.id, fraud: f.fraud}) AS fs

将仅返回列表中原始节点的副本,而不是连接的节点:

│"f.id"│"f.fraud"│"COUNT(*)"│"fs"                                                                  │
╞══════╪═════════╪══════════╪══════════════════════════════════════════════════════════════════════╡
│"h"   │1        │2         │[{"id":"h","fraud":1},{"id":"h","fraud":1}]                           │
├──────┼─────────┼──────────┼──────────────────────────────────────────────────────────────────────┤
│"f"   │0        │8         │[{"id":"f","fraud":0},{"id":"f","fraud":0},{"id":"f","fraud":0},{"id":│
│      │         │          │"f","fraud":0},{"id":"f","fraud":0},{"id":"f","fraud":0},{"id":"f","fr│
│      │         │          │aud":0},{"id":"f","fraud":0}]                                         │

编辑2

MATCH p = (source)--(destination)
RETURN source.id, source.fraud, COUNT(*), COLLECT({id: destination.id, fraud: destination.fraud}) AS neighbors

已经非常接近 - 但缺少avg功能

0 个答案:

没有答案