Add ability to specify text encoding or disable transcoding

hi 

It was reported [here](https://github.com/ofajardo/pyreadr/issues/64) in pyreadr that trying to open [this file](https://www.dropbox.com/s/650m9kxkb8dzglw/tip2020.rda.zip?dl=0) raises the following error:

```
Unable to convert string to the requested encoding (invalid byte sequence)
```

i.e RDATA_ERROR_CONVERT_BAD_STRING

Looking at the first 30 bytes of the files I got the impression the file is in CP1252 (maybe I am looking at a completely wrong pace, I actually don't know how this file is structured):

```
RDX3\nX\n\x00\x00\x00\x03\x00\x03\x06\x01\x00\x03\x05\x00\x00\x00\x00\x06CP1252\x00
```

Looking at the [source code](https://github.com/WizardMac/librdata/blob/master/src/rdata_read.c#L720) I was expecting to get RDATA_ERROR_UNSUPPORTED_CHARSET instead. Maybe librdata is not extracting the encoding correctly for this file?

And actually, would it be possible to support non UTF-8 files?

thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add ability to specify text encoding or disable transcoding #39

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add ability to specify text encoding or disable transcoding #39

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions