Miss-alignment in tpl indices and original file

Hey,

Can you point me to a simple example that uses `par_type="constant"` and `use_rows=[list-o-lines]` within pf.add_parameters() so I can break it?! A bit below, but I will spare anyone else the pain of looking at the rest of my mess!

I am using this to parameterize all lines in some files and it seems to work mostly as expected:
```
pf.add_parameters(filenames=fname, par_type="grid",
              par_name_base=pargp, pargp=pargp,
              use_cols=[2],
              upper_bound=par_df.loc[t,'upper_bound'],
              lower_bound=par_df.loc[t,'lower_bound'],
              ult_ubound=par_df.loc[t,'ult_ubound'],
              ult_lbound=par_df.loc[t,'ult_lbound'],
              transform = par_df.loc[t, 'partrans'],
              par_style=par_style, index_cols=0,
              initial_value=initial_value, 
              mfile_skip=1)
```
The file formerly known as fname (.csv):
```
ifno,kind,value,i,j
1,inflow,4000000.0,44,0
1109,inflow,34843.7,45,0
1110,inflow,8283.5355,42,0
1135,inflow,22508.228,47,6
1161,inflow,7076.489500000001,46,17
...
```
If I leave `mfile_skip=1` out it tries to parameterize the header (e.g., `parnme = pname:sfr.inflow_inst:0_ptype:gr_usecol:2_pstyle:m_idx0:ifno ` which is obviously wrong!

If I have `mfile_skip=1` in the call `ifno` is shifted by -1 from the original model input file (e.g. `parnme = pname:sfr.inflow_inst:1_ptype:gr_usecol:2_pstyle:m_idx0:0`). If I leave `mfile_skip` out of the call and use index labels rather than locations for use_cols and index_cols I get a similar result (ifno shifted -1). I don't understand this, but maybe that is okay, it ain't the first time. That produces the following .tpl file. 
```ptf ~
,sidx,idx_strs,pargp2,parval1_2,2
0,"(0,)",idx0:0,sfr.inflow,1.0,~       p0       ~
1,"(1108,)",idx0:1108,sfr.inflow,1.0,~       p1       ~
2,"(1109,)",idx0:1109,sfr.inflow,1.0,~       p2       ~
...
```

I do something similar for other files, but with a subset of lines. It seems the use_rows arg requires positional indexing, hence I use positional for use_cols and index_cols too and require the mfile_skip=1 to avoid parameterizing the header:

```
use_rows = df[df['cp_layer'] == lay].index.tolist() # [0,38,40,42,44...]
pf.add_parameters(filenames=fname, par_type="constant",                                       
                                  par_name_base=f"{pargp}_{lay}", pargp=pargp,
                                  use_cols=4,
                                  upper_bound=par_df.loc[t,'upper_bound'],
                                  lower_bound=par_df.loc[t,'lower_bound'],
                                  ult_ubound=par_df.loc[t,'ult_ubound'],
                                  ult_lbound=par_df.loc[t,'ult_lbound'],
                                  transform = par_df.loc[t, 'partrans'],
                                  par_style=par_style, index_cols=[0],
                                  initial_value=initial_value, use_rows=use_rows,
                                              mfile_skip=1)
```
That builds the following .tpl file, from the original file which is as expected.
```
ptf ~
,sidx,idx_strs,pargp4,parval1_4,4
0,"('123_0',)",idx0:123_0,cp_kh,1.0,~  pname:cp_kh_1_inst:0_ptype:cn_usecol:4_pstyle:m  ~
38,"('191_0',)",idx0:191_0,cp_kh,1.0,~  pname:cp_kh_1_inst:0_ptype:cn_usecol:4_pstyle:m  ~
40,"('191_2',)",idx0:191_2,cp_kh,1.0,~  pname:cp_kh_1_inst:0_ptype:cn_usecol:4_pstyle:m  ~
...

original file:
Name          x                                   y ...
123_0 1679938.8593249405 5413977.31979702 ...
123_1 1679938.8593249405 5413977.31979702 ...
180_0 1680281.4905325829 5401861.768442976...
180_1 1680281.4905325829 5401861.768442976 ...
...
```

Pest builds fine from pst_from with the above bits. But when I try to `pyemu.helpers.apply_list_and_array_pars(arr_par_file='mult2model_info.csv')` I get the following error related to the 2nd `add_parameters` call:

```
AssertionError: Probable miss-alignment in tpl indices and original file:
mult idx[:10] : [1230.0, 1910.0, 1912.0, 1914.0, 1916.0, 1918.0, 1920.0, 1922.0, 1924.0, 1926.0]
org file idx[:10]: ['123_0', '123_1', '180_0', '180_1', '180_10', '180_11', '180_12', '180_13', '180_14', '180_15']
n common: 0, n cols: 1, expected: 46.0.
```
From line 2213 in helpers.py
```
mlts
Out[4]: 
         Unnamed: 0         sidx     idx_strs pargp4  parval1_4    4
0                                                                   
1230.0            0   ('123_0',)   idx0:123_0  cp_kh        1.0  1.0
1910.0           38   ('191_0',)   idx0:191_0  cp_kh        1.0  1.0
1912.0           40   ('191_2',)   idx0:191_2  cp_kh        1.0  1.0
...

new_df
Out[5]: 
        oidx                   1                  2
0                                                  
123_0      0  1679938.8593249405   5413977.31979702 ...
123_1      1  1679938.8593249405   5413977.31979702 ...
180_0      2  1680281.4905325829  5401861.768442976 ...
180_1      3  1680281.4905325829  5401861.768442976 ...
 ...
```

So clearly the issue is in `common_idx = (new_df.index.intersection(mlts.index).drop_duplicates())` but I can't track how mlts gets its index or how I can adjust my pf builds to ensure it lines up with "new_df".

Any suggestions would be helpful. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Miss-alignment in tpl indices and original file #507

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Miss-alignment in tpl indices and original file #507

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions