3.99 See Answer

Question: Describe and contrast the focus of data


Describe and contrast the focus of data mining and predictive analytics. Give some examples.


> What are the DBA's managerial roles? Describe the managerial activities and services provided by the DBA.

> Describe and characterize the skills desired for a DBA.

> Explain how the DBA plays an arbitration role for an organization's two main assets. Draw a diagram to facilitate your explanation.

> Explain and contrast the differences and similarities between the DBA and DA.

> Explain the DBA department's internal organization, based on the DBLC approach.

> Why and how are new technological advances in computers and databases changing the DBA's role?

> How can the DBA function be placed within the organization chart? What effect(s) will such placement have on the DBA function?

> Write a query to display the SKU (stock keeping unit), description, type, base, category, and price for all products that have a PROD_BASE of water and a PROD_CATEGORY of sealer. FIGURE P7. 45 WATER-BASED SEALERS PROD SKU PROD DESCRIPT PROD TYPE PR

> Describe the DBA's responsibilities.

> What special considerations must you take into account when introducing a DBMS into an organization?

> Explain the difference between data and information. Give some examples of raw data and information.

> What is ADO.NET, and what two new features make it important for application development?

> How does ADO complement OLE-DB?

> Explain the OLE-DB model based on its two types of objects.

> What is OLE-DB used for, and how does it differ from ODBC?

> What steps are required to create an ODBC data source name?

> What are the three basic components of the ODBC architecture?

> What is the difference between DAO and RDO?

> Write a query to display the eight departments in the LGDEPARTMENT table.

> Define SQL data services and list their advantages.

> Summarize the main advantages and disadvantages of cloud computing services.

> Using the Internet, search for providers of cloud services. Then, classify the types of services they provide (SaaS, PaaS, and IaaS).

> Name and describe the most prevalent characteristics of cloud computing services.

> Name and contrast the types of cloud computing implementation.

> What is cloud computing, and why is it a “game changer”?

> What are ODBC, DAO, and RDO? How are they related?

> What is a JDBC, and what is it used for?

> What are XML schema definition (XSD) documents and what do they do?

> What are document type definition (DTD) documents and what do they do?

> Using the results of the query created in Problem 42, find the total value of the product inventory. The results are shown in Figure P7.43. FIGURE P7.43 Total Value of All Products in Inventory Total Value of Inventory 15084.52

> What is XML, and why is it important?

> What are scripts, and what is their function? (Think in terms of database applications development!)

> What is a Web application server, and how does it work from a database perspective?

> What does this statement mean: The Web is a stateless system? What implications does a stateless system have for database applications developers?

> Search the Internet for Web application servers. Choose one and prepare a short presentation for your class.

> What are Web server interfaces used for? Give some examples.

> What is a DataSet, and why is it considered to be disconnected?

> Give some example of database connectivity options and what they are used for.

> What are the key assumptions made by the Hadoop Distributed File System approach?

> What is polyglot persistence, and why is it considered a new approach?

> Create a query to produce the summary of the value of products currently in inventory. Note that the value of each product is produced by the multiplication of the units currently in inventory and the unit price. Use the ORDER BY clause to match the orde

> Explain why veracity, value, and visualization can also be said to apply to relational databases as well as Big Data.

> How is stream processing different from feedback loop processing?

> What is stream processing, and why is it sometimes necessary?

> Explain the difference between scaling up and scaling out.

> Explain why companies like Google and Amazon were among the first to address the Big Data problem.

> Describe the characteristics of predictive analytics. What is the impact of Big Data in predictive analytics?

> How does data mining work? Discuss the different phases in the data mining process.

> What are the traditional 3 Vs of Big Data? Briefly, define each.

> What is data analytics? Briefly define explanatory and predictive analytics. Give some examples.

> Find the customer balance summary for all customers who have not made purchases during the current invoicing period. The results are shown in Figure P7.41. FIGURE P7.41 Summary of Customer Balances for Customers Who Did Not Make Purchases Minimum B

> Explain why graph databases tend to struggle with scaling out?

> What is the difference between a column and a super column in a column family database?

> Briefly explain the difference between row-centric and column-centric data storage.

> How are the value components of a key-value database and a document database different?

> What are the four basic categories of NoSQL databases?

> Briefly explain how HDFS and MapReduce are complementary to each other.

> Explain the basic steps of MapReduce processing.

> What is the difference between a name node and a data node in HDFS?

> What is Big Data? Give a brief definition.

> While working as a database analyst for a national sales organization, you are asked to be part of its data warehouse project team. Your data warehousing project group is debating whether to create a prototype of a data warehouse before its implementatio

> Find the listing of customers who did not make purchases during the invoicing period. Your output must match the output shown in Figure P7.40. FIGURE P7.40 Customer Balances for Customers Who Did Not Make Purchases CUS_CODE CUS_BALANCE 10010 0.00 1

> Trace the use of the transaction log in database recovery.

> While working as a database analyst for a national sales organization, you are asked to be part of its data warehouse project team. Prepare a high-level summary of the main requirements for evaluating DBMS products for data warehousing.

> Give three examples of likely problems when operational data are integrated into the data warehouse.

> What is a data warehouse, and what are its main characteristics? How does it differ from a data mart?

> What are the most relevant differences between operational and decision support data?

> Explain how the main components of the BI architecture interact to form a system. Describe the evolution of BI information dissemination formats.

> What are decision support systems, and what role do they play in the business environment?

> Discuss the most common performance improvement techniques used in star schemas.

> In the star schema context, what are attribute hierarchies and aggregation levels and what is their purpose?

> Describe the BI framework. Illustrate the evolution of BI.

> Explain multidimensional cubes, and describe how the slice and dice technique fits into this model.

> Create a query to find the customer balance characteristics for all customers, including the total of the outstanding balances. The results of this query are shown in Figure P7.39. FIGURE P7.39 Customer Balance Summary for All Customers Total Balan

> Explain the use of facts, dimensions, and attributes in the star schema.

> Explain ROLAP, and list the reasons you would recommend its use in the relational database environment.

> What is OLAP, and what are its main characteristics?

> Briefly discuss OLAP architectural styles with and without data marts.

> While working as a database analyst for a national sales organization, you are asked to be part of its data warehouse project team. The data warehouse project is in the design phase. Explain to your fellow designers how you would use a star schema in the

> While working as a database analyst for a national sales organization, you are asked to be part of its data warehouse project team. The project group is ready to make a final decision between ROLAP and MOLAP. What should be the basis for this decision? W

> While working as a database analyst for a national sales organization, you are asked to be part of its data warehouse project team. One of your vendors recommends using an MDBMS. How would you explain this recommendation to your project leader?

> While working as a database analyst for a national sales organization, you are asked to be part of its data warehouse project team. The data warehousing project group has invited you to provide an OLAP overview. The group’s members are particularly conce

> While working as a database analyst for a national sales organization, you are asked to be part of its data warehouse project team. Suppose you are selling the data warehouse idea to your users. How would you define multidimensional data analysis for the

> What is business intelligence? Give some recent examples of BI usage, using the Internet for assistance. What BI benefits have companies found?

> Using the results of the query created in Problem 37, provide a summary of customer balance characteristics as shown in Figure P7.38. FIGURE P7.38 Balance Summary for Customers Who Made Purchases Minimum Balance Maximum Balance Average Balance 345.

> How does a BASE system differ from a traditional distributed database system?

> What trade-offs are involved in building highly distributed data environments?

> What are the two basic styles of data replication?

> What issues should be considered when resolving data requests in a distributed environment?

> To which transparency feature are the query optimization functions related?

> What is the objective of the query optimization functions?

> Describe the different types of database requests and transactions.

> If indexes are so important, why not index every column in every table? (Include a brief discussion of the role played by data sparsity.)

> In simple terms, the DBMS processes queries in three phases. What are those phases, and what is accomplished in each phase?

> How is the processing of SQL DDL statements (such as CREATE TABLE) different from the processing required by DML statements?

> List the balance characteristics of the customers who have made purchases during the current invoice cycle—that is, for the customers who appear in the INVOICE table. The results of this query are shown in Figure P7.37. FIGURE P7.37 Ba

> What database statistics measurements are typical of tables, indexes, and resources?

> How are database statistics obtained?

> What are database statistics, and why are they important?

> What is the focus of most performance tuning activities, and why does that focus exist?

3.99

See Answer