Question

我使用Stata合并了两个数据集，现在我需要找到匹配项目的比例和数量。为此，我假设我将需要计算两个计数。

我如何同时显示两个计数，然后一一除尽？

以下是我的_merge变量的示例：

4022. | master only (1) |
4023. |     matched (3) |
4024. |  using only (2) |
4025. |  using only (2) |
4026. |  using only (2) |
4027. |     matched (3) |
4028. |     matched (3) |
4029. |     matched (3) |
4030. |     matched (3) |

我首先要计算并存储_merge下的所有变量，然后计算那些没有说“仅主机”的变量。然后将两者彼此分开。

例如：

count1    count2    fraction
6019      4020      .66 (4020/6019)

count1是_merge下的所有内容，而count2是已匹配的所有内容（仅不包括母版）。

Answer 1

使用以下玩具示例：

clear

webuse autosize
merge 1:1 make using http://www.stata-press.com/data/r14/autoexpense

首先，最好确认与“仅主机”相对应的值：

list _merge

     +-----------------+
     |          _merge |
     |-----------------|
  1. |     matched (3) |
  2. |     matched (3) |
  3. |     matched (3) |
  4. | master only (1) |
  5. |     matched (3) |
     |-----------------|
  6. |     matched (3) |
     +-----------------+

list _merge, nolabel

     +--------+
     | _merge |
     |--------|
  1. |      3 |
  2. |      3 |
  3. |      3 |
  4. |      1 |
  5. |      3 |
     |--------|
  6. |      3 |
     +--------+

然后通过首先计数相关观察值并除以生成三个变量：

count if _merge
generate count1 = r(N)

count if _merge != 1
generate count2 = r(N)

generate fraction = count2 / count1

display count1
6

display count2
5

display fraction
1.2

计算合并的观测值并计算分数

1 个答案: