I am trying out TFX pipelines, and the first step is to ingest data. In this simple example I am ingesting data from a local (google drive) csv file with CsvExampleGen, following the book "Building Machine Learning Pipelines, publish by O’reilly. However, it has proven difficult to simply import the csv-file. I don’t get an error, but the artifact is empty.
Here is the code snipped:
from tfx.components import CsvExampleGen os.chdir("/content/gdrive/MyDrive/TFXnotebooks") base_dir = os.getcwd() data_dir = os.path.join(base_dir, "data/") context = InteractiveContext() example_gen = CsvExampleGen(input_base= data_dir) context.run(example_gen)
I’ve checked with
os.listdir(data_dir) that the file is indeed there.
Note: I know in many sources it says to import
tfx.utils.dsl_utils, but this method is not supported any longer (the module doesn’t exist anymore).
Hope someone can help,