Question

我不知道如何继续这一点，因为我是SQL的新手。我有两个巨大的表，每个表有大约70,000行，具有以下类型的数据：

表1：

A S D1
B S E2

表2：

A S D1
B S C
B S E2

我想得到以下内容：

import pandas as pd
import nltk
from nltk.tag import pos_tag
from nltk import word_tokenize
df = pd.DataFrame({'noun': ['good day', 'good night']})

编辑：我需要拥有表1和表2相交的所有数据。我还需要从表1中删除与第1列和第2列匹配但不在第3列匹配的数据。

Answer 1

我认为你需要两个问题：

    SELECT TABLE1.A,TABLE1.B,TABLE1.C 
from TABLE1 inner join TABLE2 on TABLE1.A = TABLE2.A and TABLE1.B = TABLE2.B  and TABLE1.C = TABLE2.C

- 用于交叉点

    SELECT TABLE1.A,TABLE1.B,TABLE1.C 
from TABLE1 inner join TABLE2 on TABLE1.A = TABLE2.A and TABLE1.B = TABLE2.B 
WHERE TABLE1.C <> TABLE2.C

- 丢弃物