Consider only exposing necessary properties from the parser and serialiser interfaces.

christophano commented 8 years ago

As we discussed on gitter - could we consider a way to only require those properties that are used by the CsvReader/CsvWriter internals as part of the ICsvParser/ICsvSerializer interfaces?

Example from version 2.x - ICsvParser requires you to implement the properties CharPosition and BytePosition. These aren't valid for my ExcelParser, so I simply return -1 without any issues. However, v3 has introduced the TextReader property, which is less simple to work around. Returning null or throwing a NotImplementedException are possibilities, but I'd rather we could avoid that.

My suggestion would be to continue exposing properties from the individual parsers and make them accessible to the api user by exposing them via a type argument in the CsvReader/CsvWriter classes.

For example:

public class CsvReader<TParser> where TParser : ICsvParser
{
    public CsvReader(TParser parser)
    {
        Parser = parser;
    }
    public TParser Parser { get; }
    // rest of the implementation
}

which can be used like this:

using (var reader = new CsvReader<ExcelParser>(new ExcelParser("path/to/workbook")))
{
    // access a property specific to ExcelParser
    var workbook = reader.Parser.Workbook;
}

and for ease of use from the core library:

public class CsvReader : CsvReader<CsvParser>
{
    public CsvReader(CsvParser parser) : base(parser) { }
}

which will continue to be used as it is now:

using (var reader = new CsvReader(new CsvParser(File.Open("path\to\file"))))
{
    // access property specific to CsvParser
    var textReader = reader.Parser.TextReader;
}

What are your thoughts?

christophano commented 7 years ago

Sorry, I haven't updated this issue. Here's an example: ICsvParser is used (primarily) by the CsvReader class. The only members CsvReader needs ICsvParser to have is int Row { get; } and string[] Read();; which leaves TextReader TextReader { get; }, ICsvParserConfiguration Configuration { get; }, long CharPosition { get; }, long BytePosition { get; }, int RawRow { get; } and string RawRecord { get; } as specific to the implementation. Perhaps they could be left to the implementation, or moved to some sort of intermediate interface?

Something like;

public interface IParser : IDisposable
{
    int Row { get; }

    string[] Read();
}

public interface ICsvParser : IParser
{
    TextReader TextReader { get; }

    ICsvParserConfiguration Configuration { get; }

    long CharPosition { get; }

    long BytePosition { get; }

    int RawRow { get; }

    string RawRecord { get; }
}

then I could have;

public interface IExcelParser : Parser
{
    XLWorkbook Workbook { get; }

    int FieldCount { get; }
}

which would allow us to each expose the properties we want, while not having to implement any that are unnecessary.

Obviously this would be quite a bit of work, but I'm happy to start it on a branch, if anyone agrees it is worthwhile.