Skip to content

Latest commit

 

History

History
29 lines (22 loc) · 2.66 KB

TODO.md

File metadata and controls

29 lines (22 loc) · 2.66 KB

TODO

Bugs & refactorings

Future release

  • If not everything is connected to an destination when using predicates, it can be that the dataflow never finishes. Write some tests. See Github project DataflowEx for implementation how to create a predicate that always discards records not transferred.
  • Now the Connection Manager have a PrepareBulkInsert/CleanUpBulkInsert method. There are missing tests that check that make use of the Modify-Db settings and verify improved performance. DbDestination modifies these server side settings only once and at then end end of all batches.
  • VoidDestination: Use a NullBlock as Target
  • Check if SMOConnectionManager can be reinstalled again
  • ODBC connection managers for MySql etc.
  • add tests that support MariaDb (there is one test failing with CreatTableTask & ComputedColumn)

Update Docu

  • Improving Lookup with new set of attributes to define matching and retrieving properties. Also a new Aggretion component that simplifies creating aggregates (e.g. to calculate SUM, MIN, MAX or Count or any other custom defined calculation).
  • All text files source (Csv, Json, Xml) now accept either a file path OR an URL which is loaded with a HttpClient.
  • Excel source now skip blank lines

Enhancements

  • All sources (DbSource, CsvSource, etc. ) always read all the data from the source. For development purposes it would be benefical if only the first X rows are read from the source. A property public int Limit could be introduced, so that only the first X rows are read for a DBSource/CSVSource/JsonSource/. This is quite easy to implement as SqlTask already has the Limit property. For Csv/Json, there should be a counter on the lines within the stream reader...
  • CreateTableTask.CreateOrAlter(): add functionality to alter a table (with migration if there is data in the table).
  • CreateTableTask: Function for adding test data into table (depending on table definition)
  • New feature: Bounded Capacity for all Buffers (separately for every component & general static property in DataFlow), to restrict buffer size and max memory consumption

Todo

  • PrimaryKeyConstrainName now is part of TableDefinition, but not read from "GetTableDefinitionFrom"
  • GCPressure was detected on CSVSource - verify if CSVSource really is the root cause. (See performance tests, improve tests that uses memory as source)
  • in order to have these tests fully working, add something like MaxBufferSize as DataFlow parameter for all DataFlowTasks and use this when creating DF components - also have a static DefaultMaxBufferSize as Fallback value