有效地从JSON文件中删除无效字符?

时间:2018-08-20 13:29:26

标签: json go

我正在通过命令行读取文件。

由于该文件是Oracle的JSON导出,因此具有一定的结构。由于某种原因,此默认结构不是有效的JSON。示例:

// This isn't valid JSON
,"items":
[
{"id":123,"language":"ja-JP","location":"Osaka"}
,{"id":33,"language":"ja-JP","location":"Tokyo"}
,{"id":22,"language":"ja-JP","location":"Kentok"}
]}

我希望它只是一个对象数组,从而具有预期的输出:

// This is valid json
[
{"id":123,"language":"ja-JP","location":"Osaka"}
,{"id":33,"language":"ja-JP","location":"Tokyo"}
,{"id":22,"language":"ja-JP","location":"Kentok"}
]

因此,我需要从文件的最后一行完全删除第1行以及最后}

正在通过命令行从输入中解析文件:

file, err := ioutil.ReadFile(os.Args[1])

我正在尝试以这种方式删除无效的字符串/单词,但不会重新格式化任何内容:

// in func main()
removeInvalidJSON(file, os.Args[1])

// later on .. 
func removeInvalidJSON(file []byte, path string) {

    info, _ := os.Stat(path)
    mode := info.Mode()

    array := strings.Split(string(file), "\n")
    fmt.Println(array)

    //If we have the clunky items array which is invalid JSON, remove the first line
    if strings.Contains(array[0], "items") {
        fmt.Println("Removing items")
        array = append(array[:1], array[1+1:]...)
    }

    // Finds the last index of the array
    lastIndex := array[len(array)-1]

    // If we have the "}" in the last line, remove it as this is invalid JSON
    if strings.Contains(lastIndex, "}") {
        fmt.Println("Removing }")
        strings.Trim(lastIndex, "}")
    }

    // Nothing changed?
    fmt.Println(array)

    ioutil.WriteFile(path, []byte(strings.Join(array, "\n")), mode)
}

上面的函数确实写入了我所看到的文件-但据我所知,它不会更改数组,也不会将其写入文件中。

如何有效地将文件的第一行以及文件中的最后一个假花括号}移开?

我在另一个函数中将JSON解组:是否有使用"encoding/json"库更“干净”地执行JSON的方法?

1 个答案:

答案 0 :(得分:1)

此代码有几个重大问题,导致其行为不正常。我已经在下面的评论中提到了这些:

func removeInvalidJSON(file []byte, path string) {

    info, _ := os.Stat(path)
    mode := info.Mode()

    array := strings.Split(string(file), "\n")
    fmt.Println(array)

    //If we have the clunky items array which is invalid JSON, remove the first line
    if strings.Contains(array[0], "items") {
        fmt.Println("Removing items")
        // If you just want to remove the first item, this should be array = array[1:].
        // As written, this appends the rest of the array to the first item, i.e. nothing.
        array = append(array[:1], array[1+1:]...)
    }

    // Finds the last ~index~ *line* of the array
    lastIndex := array[len(array)-1]

    // If we have the "}" in the last line, remove it as this is invalid JSON
    if strings.Contains(lastIndex, "}") {
        fmt.Println("Removing }")
        // Strings are immutable. `strings.Trim` does nothing if you discard the return value
        strings.Trim(lastIndex, "}")
        // After the trim, if you want this to have any effect, you need to put it back in `array`.
    }

    // Nothing changed?
    fmt.Println(array)

    ioutil.WriteFile(path, []byte(strings.Join(array, "\n")), mode)
}

我认为您想要的更像是

func removeInvalidJSON(file []byte, path string) {
    info, _ := os.Stat(path)
    mode := info.Mode()

    array := strings.Split(string(file), "\n")
    fmt.Println(array)

    //If we have the clunky items array which is invalid JSON, remove the first line
    if strings.Contains(array[0], "items") {
        fmt.Println("Removing items")
        array = array[1:]
    }

    // Finds the last line of the array
    lastLine := array[len(array)-1]

    array[len(array)-1] = strings.Trim(lastLine, "}")

    fmt.Println(array)

    ioutil.WriteFile(path, []byte(strings.Join(array, "\n")), mode)
}