BUG: Fix na_values dict not working on index column (#57547) #57965

tomhoq · 2024-03-22T14:51:57Z

closes BUG: na_values dict form not working on index column #57547
[Tests added and passed]
All [code checks passed]
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/v3.0.0.rst file if fixing a bug or adding a new feature.

In the read_csv method, pandas allows having na_values set as a dict, which in such case lets you decide which values are null for each column. In the occurence of one of the columns being None, no null values are applied to the column and it remains as it was. This specific case is what is being tested in the issue #57547.

The problem was that in these particular conditions variables col_na_values and col_na_fvalues were not being set correctly causing a TypeError. All i had to do was correctly define these variables as empty sets in an else block.

On the python engine this same logic was not yet programmed. I implemented it, by adding an if statement, ensuring na_values are only applied if the column is not None.

* fix base_parser not setting col_na_values when na_values is a dict containing None * fix python_parser applying na_values in a column None * add unit test to test_na_values.py; * update whatsnew.

tomhoq · 2024-04-07T10:45:18Z

@rhshadrach Would it be possible to ask someone to review this PR? Thank you

mroeschke · 2024-04-09T17:08:40Z

Thanks @tomhoq

tomhoq · 2024-04-09T22:44:42Z

Thank you for the review

pandas-dev#57965) BUG: Na_values dict not working on index column (pandas-dev#57547) * fix base_parser not setting col_na_values when na_values is a dict containing None * fix python_parser applying na_values in a column None * add unit test to test_na_values.py; * update whatsnew.

BUG: Na_values dict not working on index column (#57547)

06fb11e

* fix base_parser not setting col_na_values when na_values is a dict containing None * fix python_parser applying na_values in a column None * add unit test to test_na_values.py; * update whatsnew.

mroeschke approved these changes Apr 9, 2024

View reviewed changes

mroeschke added the IO CSV read_csv, to_csv label Apr 9, 2024

mroeschke added this to the 3.0 milestone Apr 9, 2024

mroeschke merged commit 5376e2a into pandas-dev:main Apr 9, 2024
50 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Fix na_values dict not working on index column (#57547) #57965

BUG: Fix na_values dict not working on index column (#57547) #57965

tomhoq commented Mar 22, 2024

tomhoq commented Apr 7, 2024 •

edited

Loading

mroeschke commented Apr 9, 2024

tomhoq commented Apr 9, 2024

BUG: Fix na_values dict not working on index column (#57547) #57965

BUG: Fix na_values dict not working on index column (#57547) #57965

Conversation

tomhoq commented Mar 22, 2024

tomhoq commented Apr 7, 2024 • edited Loading

mroeschke commented Apr 9, 2024

tomhoq commented Apr 9, 2024

tomhoq commented Apr 7, 2024 •

edited

Loading