CSVTool

Brief:

CSVTool is simple windows based application developed to perform various operation on CSV file, before its consumption by another application. It helps to rectify the error in CSV file in preliminary stage ensuring smooth running for another application consuming these files. 

Need:

We were required to do various operation on CSV file of larger magnitude on regular basis using SSIS package (SQL Server Integration Service). Depending on this process further operations were performed. Normally we used to get CSV file of around 600 MB.  If CSV files are not in proper format it hammers the execution process creating further problems. To avoid these kinds of situation, we have developed precautionary tool for validating and performing various manipulation operations on CSV file. To smoothen the further execution process and rectify the error in preliminary stage.

How it works:



Available Operations for CSV and TSV file

We have prepared windows application with tabular view for various operations like validate file, replace, file with issues, split file, join file and word count. 

Validate: This is the most important functionality in application, it validates the given CSV file with help of “Separator count” number of commas expected in single record in csv file. It displays total records, number of valid and invalid records and file status.    

Replace: This functionality is used to replace string or char in given csv file and new file is created in specified output location with replaced content.

Issues: This functionality is used to validate the CSV file using above validate functionality and also it creates two different files for valid and invalid records at specified location.

Split File: This functionality is used to break down massive csv file into smaller chunks by specifying the “Split count” number of records in a file. Making it more manageable for user.  

Join File: This functionality is used to club all small CSV chunk (created from above functionality) and joined them to create one single CSV file. One need to provide folder location where small CSV file is present.

Word Count: This functionality is used to get the occurrence of specified word in csv file. 

Available Operations for source type: Other



One has to specify the data separator for other source type and “.txt” is considered as default file extension. Remaining all functionality is same.  





Podcast

Michael Patterson sat down with the CEO of Boston Byte, Mustapha Shaikh to discuss the significance and rapid digitization of the healthcar...