~ihris+cssc/ihris-manage/data-import-4.0 : contents of README.txt at revision 99

~ihris+cssc/ihris-manage/data-import-4.0 : (revision 99)
The script 
    import_tools/module_creator.php
is used to create the modules for standradized data which
resides .csv files in:
    import_tools/standrad_data
the name of the .csv file (without the .csv) is the name
of the form $form that a module should be created for. the first 
row of each .csv file is the name of the field in the form
that column corresponds to.  The modules are created as:
     modules/data-$form/ihris-manage-data-$form.xml
It extracts the district list from the employee list as provided from
IMA.  This employee list is expected to live in:
      import_tools/sensitive_data/emplyees.csv


Note:  eveyrthing in import_tools/sensitive_data should be 'bzr ignore'd.





================OLD README  NO LONGER RELEVANT==============
There are two scripts that can be run.  Both scripts use the 
"base data" included in the 'data/' subdirectory. This defines 
things like Employers, Payers, Georaphic data, etc.

import_facility.php
 Expects as one command line argument the name of a CSV file
 containing facility data.  
 As the 'CSSC Code' is unique per facilility it can
 be run multiple times on the same facility data file and
 will not produce duplicate facilities. 
 The expected columns of the facility data are:
    cssc_code
    type
    reg_no
    facility
    zone
    region
    district
    contact
    phone
    alt_phone 
    email
    address
 although the last two columns are not used. 
 It produces a file:
   facilities.incomplete.DATE.csv
 which contain a list of the facilities with incomplete data.
 The columns of this file are the same as the input file with an
 additional column containing a brief description as why this
 facility data was considered incomplete.



import_employee.php
 Expects as one command line argument the name of a CSV file
 containing employee data.   As there is no unique identifier
 for an employee, if it is rerun on the same file, it will
 produce DUPLICATE records.
 The expected columns of the employee data are:
      rev_date
      nbr
      cssc_code
      id_num
      zone
      name
      sex
      birth_year
      job_unrevised
      employer_short
      degree_short
      qual_year
      years_at_facility
      payer_short
      year_last_promotion
      last_training
      training_topic
      salary_monthly
      salary_comments
      zone
      rec_change
      job_short
      region
      district
 Training data is ignored.  The job_unrevised column is ignored in
 favor of the job_short column which contains the job code.  It attempts
 create a new position for each employee.  It outputs four files:
    employees.bad.DATE.csv
	The employess whose records could not be added to the
        system. For example, they were missing a name.
    employees.noposition.DATE.csv
        The employees who could be added to the system, but who do not
        have a position (for example because of a bad job code)
    employees.incomplete.DATE.csv
        The employees who were added to the system with a position,
        but some of the provided information is incomplete or inaccurate
    employees.good.DATE.csv
        The employees who were succesfully added to the system.
 The columns of all these files are the same as the input file, with
 the addition of an extra column indicating the problem.