SFTP Monitoring
Objective
To demonstrate the capability of the FTPGet component to monitor files in a Source location on an SFTP server using Private Key authentication and the Highest Modification Time filter condition.
Prerequisites
- Start Fiorano Enterprise Server (FES) and Fiorano Peer Server (FPS)
- Start SFTP server. If SFTP server is not installed, refer the Installing SFTP Server section below.
- Login to eStudio application
- Understand the basic menus and perspectives (panels) in eStudio application
Scenario
Using FTPGet Component, download files whose last modification time is greater than the highest modification time found in the last polling cycle to the local destination from SSH-protected FTP server.
Components used
Setting up the Example
Installing SFTP Server
Install the SFTP server in Linux following the steps explained below (please ignore if the installation is already done):
- Install vsftpd server by executing the following command in the Command prompt:
yum install vsftpd
- Generate key pairs (both RSA and DSA) and store in ~/.ssh/ directory. For key generation, we can use the following commands for RSA type and DSA type key generations respectively:
ssh-keygen -t rsa
ssh-keygen -t dsa
- While executing the above commands, the file name and password for the file have to be provided. If the file name provided is id_rsa, then two files will be generated named id_rsa.pub and id_rsa which are public key and private key respectively.
- Install the public keys in the server. The private key has to be with the client which is used to connect to the server. The SFTP server will authenticate the client's private key using its public key. This type of client authentication is called Private Key Authentication. The installation of the public key in the server can be done by executing the following command:
ssh-copy-id -i ~/.ssh/id_rsa.pub root@localhost
where "~/.ssh/id_rsa.pub" is the RSA public key file path and "root@localhost" is the server (here we are using the same machine). - Change the following in the sshd_config file located at etc/ssh/ folder:
- Set RSAAuthentication or DSAAuthentication to 'yes' based on the key file type used.
- Add the line "IdentityFile ~/.ssh/id_rsa" in the ssh_config file located at /etc/ssh (if already Identity File is set to some other file, modify it).
- Restart the servers using the following commands:
/etc/rc.d/init.d/sshd restart
/etc/rc.d/init.d/vsftpd restart
After performing the above actions, the server will be ready to accept the SFTP protocol from clients to connect to it. Server settings in the FTP component help to connect to the server and perform Put/Get operations on the server.
Configuring FTPGet component
The FTPGet component helps to download the file from the FTP Server. To configure FTPGet component, perform the following actions:
- Add a new Event Process (EP) with the name 'SFTP_Monitoring_Eg'.
- Drag the FTPGet component from the Bridges category in the Micro Service Palette to the Fiorano Orchestrator.
Figure 1: Adding FTPGet component in SFTP_Monitoring_eg EP
- Double-click the FTPGet component to open its Configuration Property Sheet (CPS).
Figure 2: FTPGet CPS
SFTP Protocol Configuration
To enable Secured File Transfer, the SFTP protocol needs to be set in the connection configuration. To configure the STP protocol, perform the following actions:
- In theManaged Connection Factory window, click Connection Properties tab to expand the options and click the Connection Configuration ellipsis button.
Figure 3: Navigating to do Connection Configuration
- In FtpConnectionConfig dialog box, select 'SFTP' from the Protocol drop-down under Component Configuration section and provide the following values menu to load the SFTP server configuration settings:
- Remote host: The IP address of the machine where the SFTP server is running. 'localhost' may be used if the server is installed on the same machine.
- Port: 22, the port number on which the FTP server is running.
- Login: Username to be used to login to the SFTP server. In the example, it is root, the user under which the server had been installed in the machine.
- Client Authentication Type: Select 'Private Key' from the drop-down to login to the SFTP server using the client's private key. Three fields needed for private key authentication appear: Private Key Type, Private Key file, and Private Key Password.
- Private Key: Provide the path of the private key (id_rsa) generated above in the Private key field. Select the Private key type as RSA/DSA depending on the generated key. Enter the Private key password also to provide access to the key by the server.
Figure 4: SFTP protocol configuration
- Click OK to save the server configuration.
- After saving the connection configuration details, click the Test button in Managed Connection Factory page. Test Connection Configuration pop-up shows the connection status.
Figure 5: Testing SFTP Protocol connection
Monitor Directory Configuration
To make the FTPGet adapter poll a directory on the server for files matching a particular pattern and download all such files from the server. Enabling this property makes the FTPGet adapter poll the Source directory using the polling configuration specified in Scheduler Configurations panel. After ensuring that the Source directory exists on the server, perform the following actions:
- In the Managed Connection Factory page, click Next to move to the Interaction Configurations Page.
Figure 6: Interaction Configurations panel in FTPGet CPS
- Select the Monitor Directory checkbox to enable Monitoring on the SFTP server. Three options appear as below:
- Monitor Directory Configuration
- Monitor Time Configuration
- Use File not FoundException in Monitoring
Figure 7: Selecting Monitor Directory option
- Click the Monitor Directory Configuration Ellipsis button to open the FtpMonitoringConfiguration window.
Figure 8: Navigating to Monitor Directory Configuration
- In the FtpMonitoringConfiguration window, provide the following details:
- Enter the folder path on the server where the files need to be monitored. Enter paths for Working directory, Processed directory and Error directory also and all of these paths should be different from each other.
- Select Move to working directory checkbox, if the files which are processed need to be moved to the Processed directory after the transfer is complete. Also, files will move to the error directory in case of any unexpected error during the transfer. If the checkbox is cleared, then files will remain in the source folder even after the processing is complete.
- Provide value for File name patterns to filter specific file types that need to be monitored in the source folder. In this example, it is "*.csv", which picks all the .csv files in the source folder for processing.
Figure 9: Configuring Directory details
- Click OK to save the Monitoring Directory configuration.
Monitor Time Configuration
Different Time filters help to filter the files while monitoring the source folder based on their last modified time.
To configure the component with the Highest Modification Type filter type as in the example scenario, perform the following actions:
- Under Monitoring Settings section in the Interaction Configurations panel, click the Monitor Time Configuration ellipsis button.
Figure 10: Navigating to Monitor Time Configuration
In the FtpMonitorTimeConfiguration dialog box, select Time-based file filtering type as 'HIGHEST_MODIFICATION_TIME' and then click OK.
To understand the functionality of each Time filter type, refer to the FTPGet section.
Figure 11: Selecting HIGHEST_MODIFICATION_TIME filter type
- For HIGHEST_MODIFICATION_TIME filter type, two other options namely Base Time and Remote host time offset appear.
Figure 12: Configuring Highest Modification Time filter type details- Base Time: 23:11:2012 15:25 During monitoring, the Base Time value will be used only in the first poll for this filter type. Only the files with the last modified time greater than the base time will be picked in the first poll from the source folder.
- Remote host time offset: -03:30 This will be the time difference between the SFTP server time and the client machine local time.
Selecting Use File not Found Exception in Monitoring checkbox under Monitor Settings section enables the FTPGet component to throw an exception when no files are found in the monitored folder matching the file pattern name and time-filter types selected.
Target Directory Configuration
In the Interaction Configurations panel, perform the following actions:
- In the Parent directory path on the local system text field under Target Settings section, enter the Parent Directory path relative to which a relative path of the Target directory would be computed. Provide it as D:\dev\ftp
Figure 13: Providing parent directory
- Click the Target Directory Configuration Ellipsis button to configure target directory.
- In the TargetConfig dialog box, make the following changes, and click OK to save the Target Directory Configuration:
- Target Directory: Enter the path relative to the parent directory provided in the previous step. Files that are processed while monitoring the source folder will be saved to this folder after the transfer is completed. Provide it as 'receivedFiles'.
- Select Use Temporary target directory checkbox to ensure that only completely transferred files will be saved in the target directory. The temporary target directory will be named 'temp' by default.
- Select the appropriate option (Overwrite/Rename/Append) from the Action if same file exists drop-down to define the action when the file being transferred matches the file name of an already existing file. Select OVERWRITE.
Figure 14: Providing Target Directory details
- Under Target Settings section, select Auto create Target Directory checkbox to automatically create the target directory at the given path if it is not already present on the system.
Monitor Scheduler Configuration
To configure scheduling information, continue FTPGet configuration as follows:
- In the Interaction Configurations panel, click Next to navigate to the Scheduler Configurations panel.
Figure 15: Providing Scheduler Configurations
- Under Scheduler Configurations tab, set Repeat Interval text field as 5 setting the time gap required between each poll while monitoring the source folder for any matching files.
To set how many times the poll should run on the folder before stopping the poll, clear the Repeat forever checkbox and provide a value in the Repeat Only 'n' Times field. In this case, retain it as it is.
Advanced scheduling options can be found in the Advanced Configuration tab. Daily/Weekly/Monthly polls with specific Start Time and End Time Sessions is possible. - Click Finish to save the Component configuration.
After the above steps are completed, the FTPGet component gets configured for monitoring the files in the source folder on the SFTP server.
Similarly, you can configure the FTPPut component to do Put operations on the SFTP server by following similar steps as above with some minor changes such as the Source folder being on the local system and the Target Directory on the SFTP server (as the Put operation is just opposite to that of Get).
Composing the Event Process
To add a Display component to display the output response from the FTPGet component after each file transfer gets completed in the SFTP_Monitoring_Eg event process, perform the following actions:
- Drag the Display component also to the orchestrator editor from Micro Service Palette under Util category.
- Connect the output port of FTPGet component to input port of Display component.
Figure 16: Connecting FTPGet component to Display
Running the Example
Ensure that the SFTP Server is running and perform the following actions:
- Click Check Resource and Connectivity icon (or press ALT+SHIFT+C) to check resource and connectivity.
- Click Run Event Process icon (ALT+SHIFT+R) to run the event process. Display window gets opened.
- Add a file 'Newfile.csv' with last modified time as '12 March 2014 08:09:25 PM' in the source folder after the second poll.
- Display window gets the message when the file transfer happens.
Figure 17: Message appearing in Display after the File Transfer
How the application works is explained in the below section.
How it Works
For SFTP Monitoring with Highest-Modification Time-filter type enabled, the event process goes through the following steps during the processing:
- The source folder configured in the FTP component contains the following files with the corresponding time of modification:
- CheckHead_01232014.csv: Tue 11 Mar 2014 11:00:20 PM IST
- CheckItems_01232014.csv: Wed 12 Mar 2014 07:52:55 PM IST
- Data.txt: Wed 12 Mar 2014 07:39:16 PM IST
- Hello.txt: Thu 23 Jan 2014 05:48:34 PM IST
- Number.txt: Wed 12 Mar 2014 07:38:57 PM IST
Figure 18: Contents in the Source Folder
- In the first poll, after launching the event process (12 March 2014 08:02:15 PM in this example), while monitoring the folder, only the files 'CheckHead_01232014.csv' and 'CheckItems_01232014.csv' will be picked for processing, as they match both the pattern name condition (.csv file type) as well as the HIGHEST MODIFICATION TIME filter condition for the first poll. Both the files have a last modification time greater than the Base time provided in the Monitor Time Configuration.
- After the first poll is completed and all the matching file transfers are finished, the component calculates the Highest Modification Time among all the processed files and saves it for the next poll. In this case, it will be '12 March 2014 07:52:55 PM'.
- In the Second poll (on 12 March 2014 08:07:15 PM as per Repeat Interval - 5 minutes), none of the files will be processed, as all the files have a last modification time less than or equal to the Highest Modification Time calculated in the previous poll.
- After the Second poll, add a file 'new.csv' with the last modified time as '12 March 2014 08:09:25 PM' in the source folder. (please do this manually)
- In the Third poll, the Highest Modification Time value will still be '12 March 2014 07:52:55 PM' as it did not process any new files in the previous poll. Now the file named Newfile.csv will be processed in the third poll (at 08:12:15 PM) as its last modification time is greater than the Highest Modification Time value and matches the file pattern name. The FTPGet Output response appears in the Display window (as provided in the "Running the Example" section) after the transfer is completed.
- After the transfer of Newfile.csv file is completed, the Highest Modification Time value will be set to the last modified time of the Newfile.csv file as it is the only file processed during the third poll and the component will continue its polling as per the Scheduler Configuration.
If the same example is modified by using NONE Time-filter type, all the files with .csv type will be moved to the Working Directory and subsequently moved to Processed Directory as the transfer gets completed.
Change the time using the above logic.
Reference
- Download SFTP_Monitoring_Eg event process and import it to the Event Process Repository in the Server Explorer in eStudio to understand the configuration used in this example and to execute the working sample.
- Download the Private Key File used in this example, save it in a particular location and provide it in the (See Figure 4).
- Please note that the component options used in this example are minimal, which helps you to get an overview of the application. To explore the other options present in the FTPGet and Display components, refer to the FTPGet and Display sections.