Q1: Define Enterprise Data Warehousing?
A1: When the data of organization is developed at a single point of access it is known as enterprise data warehousing.

Q2: Differentiate between a database and data warehouse?
A2: Database have a group of useful information which is brief in size as compared to data warehouse whereas in data warehouse their are set of every kind of data whether it is useful or not and data is extracted as the the requirement of customer.

Q3: What do you understand by a term domain?
A3: Domain is the term in which all interlinked relationship and nodes are under taken by sole organizational point.

Q4: Differentiate between a repository server and a powerhouse?
A4: Repository server mainly guarantees the repository reliability and uniformity while powerhouse server tackles the execution of many procedures between the factors of server’s database repository.

Q5: In Informatica WorkFlow Manager, how many repositories can be created?
A5: It mainly depends upon the number of ports we required but as general there can be any number of repositories.

Q6: Write the advantages of partitioning a session?
A6: The main advantage of partitioning a session is to get better server’s process and competence. Other advantage is it implements the solo sequences within the session.

Q7: How we can create indexes after completing the load process?
A7: With the help of command task at session level we can create indexes after the load procedure.

Q8: Define sessions in Informatica ETL.
A8: Session is a teaching group that requires to be to transform information from source to a target.

Q9: What is meant by a domain?
A9: When all related relationships and nodes are covered by a sole organizational point, its called domain. Through this data management can be improved.

Q10: What is the difference between a repository server and a powerhouse?
A10: Repository server controls the complete repository which includes tables, charts, and various procedures etc. Its main function is to assure the repository integrity and consistency. While a powerhouse server governs the implementation of various processes among the factors of server’s database repository.

Q11: How many repositories can be created in Informatica?
A11: There can be any number of repositories in informatica but eventually it depends on number of ports.

Q12: What is the benefit of partitioning a session?
A12: Partitioning a session means solo implementation sequences within the session. It’s main purpose is to improve server’s operation and efficiency. Other transformations including extractions and other outputs of single partitions are carried out in parallel.

Q13: How are indexes created after completing the load process?
A13: For the purpose of creating indexes after the load process, command tasks at session level can be used. Index creating scripts can be brought in line with the session’s workflow or the post session implementation sequence. Moreover this type of index creation cannot be controlled after the load process at transformation level.

Q14: Explain sessions. Explain how batches are used to combine executions?
A14: A teaching set that needs to be implemented to convert data from a source to a target is called a session. Session can be carried out using the session’s manager or pmcmd command. Batch execution can be used to combine sessions executions either in serial manner or in a parallel. Batches can have different sessions carrying forward in a parallel or serial manner.

Q15: How many numbers of sessions can one group in batches?
A15: One can group any number of sessions but it would be easier for migration if the number of sessions are lesser in a batch.

Q16: Explain the difference between mapping parameter and mapping variable?
A16: When values change during the session’s execution it’s called a mapping variable. Upon completion the Informatica server stores the end value of a variable and is reused when session restarts. Moreover those values that do not change during the sessions execution are called mapping parameters. Mapping procedure explains mapping parameters and their usage. Values are allocated to these parameters before starting the session.

Q17: What is Expression transformation?
A17: It is used for performing non aggregated calculations. We can test conditional statements before output results move to the target tables.

Q18: Define filter transformation?
A18: Filter transformation is a way of filtering rows in a mapping. It have all ports of input/output and the row which matches with that condition can only pass by that filter.

Q19: Define Joiner transformation?
A19: It combines two associated mixed sources located in different locations while a source qualifier transformation can combine data rising from a common source.

Q20: What do you mean by Lookup transformation?
A20: Lookup transformation is used for maintaining data in a relational table through mapping. We can use multiple lookup transformation in a mapping.

Q21: How we can use Union Transformation?
A21: It is a different input group transformation that is used to combine data from different sources.

Q22: Define Incremental Aggregation?
A22: The incremental aggregation is done whenever a session is developed for a mapping aggregate.

Q23: Define mapplet?
A23: A mapplet is a recyclable object that is using mapplet designer.

Q24: What is the reusable transformation?
A24: This transformation is used various times in mapping. It is divest from other mappings which use the transformation as it is stored as a metadata.

Q25: Define update strategy.
A25: Whenever the row has to be updated or inserted based on some sequence then update strategy is used. But in this condition should be specified before for the processed row to be tick as update or inserted.

Q26: Explain the scenario which compels Informatica server to reject files?
A26: When it faces DD_Reject in update strategy transformation then it sends server to reject files.

Q27: What is the surrogate key?
A27: It is a substitute for the natural prime key. It is a unique identification for each row in the table.

Q28: Write the prerequisite tasks to achieve the session partition?
A28: In order to perform session partition one need to configure the session to partition source data and then installing the Informatica server machine in multifold CPU’s.

Q29: In informatics server Which files are created during the session rums?
A29: Errors log, Bad file, Workflow low and session log namely files are created during the session rums.

Q30: What is meant by active and passive transformation?
A30: An active transformation is the one that performs any of the following actions:

Change the number of rows between transformation input and output. Example: Filter transformation

Change the transaction boundary by defining commit or rollback points., example transaction control transformation

Change the row type, example Update strategy is active because it flags the rows for insert, delete, update or reject

On the other hand a passive transformation is the one which does not change the number of rows that pass through it. Example: Expression transformation.

Q31: What is the difference between Router and Filter?
A31: Following differences can be noted:

Router Filter
Router transformation divides the incoming records into multiple groups based on some condition. Such groups can be mutually inclusive (Different groups may contain same record) Filter transformation restricts or blocks the incoming record set based on one given condition.
Router transformation itself does not block any record. If a certain record does not match any of the routing conditions, the record is routed to default group Filter transformation does not have a default group. If one record does not match filter condition, the record is blocked
Router acts like CASE.. WHEN statement in SQL (Or Switch().. Case statement in C) Filter acts like WHERE condition is SQL.

Q32: What are the different lookup cache(s)?
A32: Informatica Lookups can be cached or un-cached (No cache). And Cached lookup can be either static or dynamic. A static cache is one which does not modify the cache once it is built and it remains same during the session run. On the other hand, A dynamic cache is refreshed during the session run by inserting or updating the records in cache based on the incoming source data. By default, Informatica cache is static cache.

A lookup cache can also be divided as persistent or non-persistent based on whether Informatica retains the cache even after the completion of session run or deletes it.

Q33: Under what condition selecting Sorted Input in aggregator may fail the session?
A33: If the input data is not sorted correctly, the session will fail.

Also if the input data is properly sorted, the session may fail if the sort order by ports and the group by ports of the aggregator are not in the same order.

Q34: Why is Sorter an Active Transformation?
A34: This is because we can select the “distinct” option in the sorter property.

When the Sorter transformation is configured to treat output rows as distinct, it assigns all ports as part of the sort key. The Integration Service discards duplicate rows compared during the sort operation. The number of Input Rows will vary as compared with the Output rows and hence it is an Active transformation.

Q35: What are the various types of transformation?

  • Aggregator transformation
  • Expression transformation
  • Filter transformation
  • Joiner transformation
  • Lookup transformation
  • Normalizer transformation
  • Rank transformation
  • Router transformation
  • Sequence Generator transformation
  • Stored procedure transformation
  • Sorter transformation
  • Update strategy transformation
  • XML source qualifier transformation

Q36: What is the difference between STOP and ABORT options in Workflow Monitor?
A36: On issuing the STOP command on the session task, the integration service stops reading data from the source although it continues processing the data to targets. If the integration service cannot finish processing and committing data, we can issue the abort command.

ABORT command has a timeout period of 60 seconds. If the integration service cannot finish processing data within the timeout period, it kills the DTM process and terminates the session

Q37: What are data-driven sessions?
A37: When you configure a session using update strategy, the session property data driven instructs Informatica server to use the instructions coded in mapping to flag the rows for insert, update, delete or reject. This is done by mentioning DD_UPDATE or DD_INSERT or DD_DELETE in the update strategy transformation.

“Treat source rows as” property in session is set to “Data Driven” by default when using an update strategy transformation in a mapping.

Q38: What are mapplets?

  • A Mapplet is a reusable object that we create in the Mapplet Designer.
  • It contains a set of transformations and lets us reuse that transformation logic in multiple mappings.

Q39: How can we delete duplicate rows from flat files?
A39: We can make use of sorter transformation and select distinct option to delete the duplicate rows.

