当我在Linux上运行此代码时,为什么这些代码始终给我一个Segfault错误?

时间:2019-04-07 15:48:48

标签: c

因此,我试图创建一个接收文本文件的函数,该文件包含一串用换行符分隔的单词,然后将文本文件读取到char**数组中。

当我在Windows上的netbeans中运行此代码时,它可以正常工作,但如果在Linux中运行它,则会出现分段错误错误。

// globals
FILE *words_file;
char **dic;
int num_words = 0;

void read_to_array() {
    words_file = fopen("words.txt", "r");
    char *line = NULL;
    int i = 0;
    size_t len = 0;
    dic = (char **)malloc(99999 * sizeof(char *));

    // read dic to array
    while (getline(&line, &len, words_file) != -1) {
        dic[i] = (char*)malloc(len);
        strcpy(dic[i], line);

        // get rid of \n after word
        if (dic[i][strlen(dic[i]) - 1] == '\n') {
            dic[i][strlen(dic[i]) - 1] = '\0';
        }
        ++i; 
        num_words++;
    }
    //printf("%s", dic[i][strlen(dic[i]) - 1]);  //testing 
    fclose(words_file);
    dic[i] = NULL;
}

我在这里想念什么?

1 个答案:

答案 0 :(得分:1)

程序中存在一些问题,可能会导致您观察到未定义的行为:

  • 您不会测试文件是否成功打开,如果文件不在您期望的位置或名称不同,则会导致未定义的行为。
  • 您不限制读入数组的行数,如果文件包含多于99998行,则会导致未定义行为,在Linux中可能是这种情况,因为/usr/share/dict/words在我的系统上有139716行,对于例子。

您的内存分配方案不是最优的,但是正确的:您应该计算单词的长度,并在分配副本之前 去除换行符。按照编码,您分配了太多的内存。但是,在从line返回之前,您应该释放read_to_array,并且应该避免使用全局变量。

这是修改后的版本:

#include <stdio.h>
#include <stdlib.h>
#include <string.h>

char **read_to_array(const char *filename, int *countp) {
    FILE *words_file;
    char *line = NULL;
    size_t line_size = 0;
    char **dic = NULL;
    int dic_size = 0;
    int i = 0;

    words_file = fopen(filename, "r");
    if (words_file == NULL) {
        fprintf(stderr, "cannot open dictionary file %s\n", filename);
        return NULL;
    }

    dic_size = 99999;
    dic = malloc(dic_size * sizeof(char *));
    if (dic == NULL) {
        fprintf(stderr, "cannot allocate dictionary array\n");
        fclose(words_file);
        return NULL;
    }

    // read dic to array
    while (getline(&line, &line_size, words_file) != -1) {
        size_t len = strlen(line);
        /* strip the newline if any */
        if (len > 0 && line[len - 1] == '\n') {
            line[--len] = '\0';
        }
        if (i >= dic_size - 1) {
            /* too many lines: should reallocate the dictionary */
            fprintf(stderr, "too many lines\n");
            break;
        }
        dic[i] = malloc(len + 1);
        if (dic[i] == NULL) {
            /* out of memory: report the error */
            fprintf(stderr, "cannot allocate memory for line %d\n", i);
            break;
        }
        strcpy(dic[i], line);
        i++;
    }
    dic[i] = NULL;
    *countp = i;
    fclose(words_file);
    free(line);
    return dic;
}

int main(int argc, char **argv) {
    const char *filename = (argc > 1) ? argv[1] : "words.txt";
    int num_words;
    char **dic = read_to_array(filename, &num_words);
    if (dic != NULL) {
        printf("dictionary loaded: %d lines\n", num_words);
        while (num_words > 0)
            free(dic[--num_words]);
        free(dic);
    }
    return 0;
}

输出:

chqrlie> readdic /usr/share/dict/words
too many lines
dictionary loaded: 99998 lines